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(54) Modified phytases 

(57) The present invention is directed to a process 
for the production of a modified phytase witii Improved 
activity properties characterized therein that the follow- 
ing steps are eff^ed: 

a) the three dimensional structure of the phrytase to 
be modified and, optionally of another phytase with 
activity properties which are more favorable tiian 
the ones of the phytase to be modified is/are conv 
puter modelled on the basis of the three dimen- 
sional structure of the phytase of Aspergillus niger; 

b) the structure of the active sites of the phytase to 
be modified and of the phytase with the more favo- 
rable activity properties are compared and tiiose 
amirK> acid residues In both active sites which are 
different are identified; 

c) a DNA sequence coding for a modified phytase is 
constructed by changing the nucleotides coding for 
at least one of tiie amino acids t>y which both active 
sides differ: 

d) integrating such a DNA sequence Into a vector 



capable of expression in a suitable host cell; 

e) fansforming a suitable host cell by the DNA 
sequence of c) or the vector of d), growing said host 
cell under suitat^e growth conditions and isolating 
the modified phytase from the host cell or the cul- 
ture medium by methods known in the state of the 
art. 

The present invention is also directed to phytases 
obtainable or obtained by such a process. 
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Description 

[0001] Phytases (mKO-inosftol hexaWsphosphate phosphohydrolases; EC 3.1.3.8) are enzymes that hydrolyze 
phytate (myno-inositol hexaWsphosphale) to /rryo-lnositol and inorganic phosphate and are known to be valuable feed 
5 additives. 

[0002] A phytase was first described in rice bran in 1907 [Suzuki et al.. Bull. Coll. Agr. ToWo Imp. Univ. 7. 495 (1907)] 
and phytases from Aspergillus species in 1911 pox and Golden, J. Bid. Chem. Ifi, 183-186 (191 1)], Phytases have 
also been found in wheat bran, plant seeds, animal intestines and in microorganisms [Hcwsen and Davis, Enzyme 
MicTob. Technol. 377-382 (1983), Lambrechts et al., Biotech. Lett. 14, 61 -66 (1992), Shieh and Ware. Appl. Microbiol. 
10 Ifi, 1348-1351 (1968)]. 

[0003] The cloning and expression of the phytase from Aspergillus niger (f icuum) has bem descnljed by Van Hart- 
ingsveldt et al., in Gene, 12L 87-94 (1993) and in European Patent Application. Publication No. (EP) 420 358 and from 
Aspergillus niger var. awamori by Piddington et al., in Gene 133. 55-62 (1993). 

[0004] Cloning, expression and purification of phytases with inproved properties have been disclosed in EP 684 313. 
15 However, since there is a still ongoing need for further improved phytases. espedally with respect to the activity prop- 
erties, it is an object of. the present invention to provide the following: 

0 A process for the production of a modified phytase with inproved activity properties characterized therein that the 
following steps are effected: 

20 

a) the three dimensional structure of the phytase to be modified and, optionally of another phytase with activity 
properties which are more favorable than the ones of the phytase to be modified is/are computer modelled on 
the basis of the three dimensional structure of the phytase of Aspergillus niger (f icuum); 

25 b) the structure of the active sites of the phytase to be modified and of the phytase with the more favorable 

activity properties are compared and those amino acid residues in both active sites which are different are 
identified; 

c) a DNA sequence coding for a modified phytase is constructed by changing the nucleotides coding for at 
30 least one of the amino acids by which both active sites differ; 

d) integrating such a DNA sequence into a vector capable of expression In a suitable host cell; 

e) transforming a suitable host cell by the DNA sequence of c) or the vector of d). growing said host cdl under 
35 suWabUe growth conditions and isolating the modified phytase from the host cell or the culture medium by meth- 
ods known in the state of the art; or 

ii) a process as described under i) wherein the phytase to be modified is of eukaryotic, preferably fungal, more pref- 
erably Aspergillus, e.g. Aspergillus fumlgatus origin; or 

40 

iii) a process as described under I) or iQ wherein the phytase with more favorak}le activity properties is of eukaryotic. 
preferably fungal, more preferably Aspergillus, e.g. Aspergillus niger or Aspergillus teneus (Aspergillus terreus cbs 
1 1 6,46 or 9 Al ) origin ; or 

45 iv) a process as described under i), il) or Iii) wherein the phytase to be modified is a phytase of Aspergillus fumlga- 
tus and the phytase with the more favorable activity properties is the Aspergillus terreus phytase or the phytase of 
Aspergillus niger. 

[0005] In this context It should be mentioned that another posstbilly of producing phytases with Improved properties 
so is by isolating phytases from the same organism, like for exanple the Aspergillus f icuum. but different strains which can 
be found in nature and have been deposited by any of the known depository authorities. Their amino acid sequences 
can be determined by cloning their corresponding DNA sequences by methods as described. e.g. in European Patent 
Appncatton No. (EP) 684 313. Once such sequences have been defined they can be modeled on the t>asis of the three- 
dimensional structure of the A. niger phytase and th^ active sites of both sequences can be compared to find out 
ss whether such phytase shouM have improved activity properties (see Example 8) or both active site sequences can be 
compared directly and than tested for increased and/or improved activity by the assays described in the present appli- 
cation. 

[0006] It is furthermore an object of the preserft invention to provide a modified phytase which is obtainable by a proc- 
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ess as described above. 

[0007] It is in general an object of the present invention to provide a phytase which has been modified in a way that 
Its activity property is more favorable than the one of the non-modified phytase. spedfically such a phytase character- 
ized therein that the amino acid sequence of the non-modified phytase has been changed by deletion, substitution 

5 and/or addition of one or more amino acids, more specifically such a phytase wherein changes have been made at at 
least one position which is homologous to one of the following positions of the amino acid sequence of the phytase of 
Aspergillus (A.) niger (see Rg.1): 27, 66. 71, 103. 140. 141, 188. 205. 234, 235, 238, 274. 277. 282, 340 and/or 424. 
preferably 27. 66. 140, 205. 274, 277, 282 and/or 340, and e^en more specifically such a phytase which is the phytase 
of euKaryotic. preferably fungal, more preferably Aspergillus and most preferably Aspergillus fumigatus. origin. 

70 [0008] It is furthermore an object of the present invention to provide such a phytase wherein at position 27 or at least 
at position 27 a change occurs, preferably a phytase wherein the amino acid at position 27 is replaced by one selected 
from one of the following groups: 

a) Ala, Val, Leu. lie; or 
15 b) Thr or 

c) Asn; and furtiiermore such a phytase wherein in addition to position 27 a change occurs also at position 66 or 
wherein in addition to position 27 a change occurs also at position 140 and/or at positions 274 and/or 277. 

[0009] tt is also an object of the present invention to provide a phytase as specified above which is characterized by 
20 at least one of tiie following mutations: Q27L. Q27N, Q27T, Q27I. Q27V. Q27A. Q27G. S66D. S140Y. D141G. A205E. 
Q274L. G277D. G277K Y282H and/or N340S. 

[001 0] H is furthermore an object of the present invention to provide phytase muteins which are resistant against deg- 
radation by proteases of fungal, preferably Aspergillus and most preferably Aspergillus niger (ficuum) origin. Such 
muteins are characterized therein that at least at one of tiie following positions (which refers to the homologous piosition 

25 in the amino acid sequence of A. niger), namely position 1 30 or 1 29 and 1 30. preferably of the Aspergillie fumigatus or 
167. 168 preferably of the A. nidulans phytase amino acid sequence, the amino acid which is present in the wild type 
sequence has been replaced against another amino acid which is known to change the protease sensitivity, e.g. in the 
case of A. fumigatus at position 130 from "S" to "N" and at position 129 from "R" to "L" and in case of A. nidulans at 
position 167 from "K" to 'GT and at position 168 from R to Q. Such positions can be also combined witii those providing 

30 for inproved activity properties. 

[0011] In this context Improved activity property" means any type of improvement of the activity of the mutated 
phytase as compared to tiie non mutated. This could mean for example a higher specific activity, preferably at least two 
fold or more preferably at least 3 to 4 fold higher in an assay known in the state of the art to measure phytase activity, 
see e.g. in EP 684 313 or described in the examples of the present application. Furthermore this could mean a different 

35 substrate specificity determined in an assay known in the state of the art or as descrit)ed e.g. in the specific examples 
of the present invention. This could also mean a maximum of the specific activity at a different nrtore favoratsle pH or a 
broad pH optimum (''improved pH profile") determined by an assay as known in the state of the art or as described e.g. 
in the examples. Finally tiii^s could also mean any combination of such properties. 

[001 2] "Homologous" in the context of the present invention mears the best fit of the primary, preferably also second- 

40 ary and most preferably also tertiary structure of the phytase to be modified and the phytase of Aspergillus niger. How 
such best fit can be obtained is described in detail in Example 1 of the present invention. Rgure 1 gives an example of 
such best fit for the phytase amino acid sequences of Aspergillus fumigatus and Aspergillus terreus aligned on the basis 
of the Aspergillus niger amino acid sequence which latter sequence is also used as the reference to which the positions 
of the other sequences, e.g. the ones named before, are referred to. Furthennore tiie modified Aspergillus fumigatus 

45 phytase with the Q27L mutation, means nothing else than the phytase of Aspergillus fumigatus wherein at position 27 
according to the assignment as defined afcx>ve (which is in fact position 23 of the Aspergillus fumigatus amino acid 
sequence) the naturally occurring glutamine ("Q" refers to the standard UPAC one letter amino acid code) has been 
replaced by leudne ("V). All muteins of the present invention are designated in this way independent from wether they 
are protease resistant muteins or muteins with improved activity properties. 

so [001 3] tt is furthermore an object of the present invention to provide a DNA-sequence comprising a DNA sequence 
coding for a phytase as described above, a vector, preferably an expression vector, comprising such a DNA sequence, 
a host cell which has been tiBnsformed by such a DNA sequence or vector, a process for the preparation of a phytase 
of the present invention wherein the host cell as described before is cultured under suitable culture conditions and the 
phytase is isolated from such host cell or the culture medium by metiiods known in the art, and a food or feed compo- 

55 sition comprising a phytase of tiie present inverrtion. 

[0014] In this context it should be noted that it is also an object of the present invention to provide a DNA sequence 
which codes for a phytase canrying at least one of the specific mutations of the present irrvention and which hybridizes 
under standard conditions with the DNA sequences of the specific modified phytas s of the present invention or a DNA 
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sequence which, becaus of the degeneracy of the genetic code does not hybridize but which codes for a polypeptide 
with exactly the same amino acid sequence as the one encoded by the DNA sequence t which it does not hybridize 
or a DNA sequence which is a fragment of such DNA sequences which maintains the activity properties of the polypep- 
tide of which it is a fragment. 

5 [001 5] "Standard conditions" for hybridization mean in the context the corditions which are generally used by a man 
skilled in the art to detect specif tc hybridization signals and which are described. e.g. by Sambrook et al., "Molecular 
Qoning", second edition. Cold Spring Hartx)r Laboratory Press 1 989, New YorK or preferat)ly so called stringent hybrid- 
ization and non-stringent washing conditions or more preferably so called stringent hybridization and stringent washing 
corxiitions a man skilled in the art is familiar with and which are described, ag. In Sambrook et al. (s.a.). 

10 [001 6] It is furthermore an object of the present invention to provide a DNA sequence which can be obtained by the 
so called polymerase chain reaction method ("PGR") by PGR primers designed on the basis of the specifically 
described DNA sequences of the present invention, ft is understood that the so obtained DNA sequences code for 
phytases with at least the same mutation as the ones from which they are designed and show comparable activity prop- 
erties. 

IS [0017] The principles off the polymerase chain reaction (PGR) method are outlined ag. by White et al., Trends in 
Genetics, 5. 185-189 (1989), whereas improved methods are described e.g. in Innis et al. [PGR Protocols: A guide to 
Methods and Applications, Academic Press. Inc. (1990)]. 

[0018] DNA sequences of the present invention can be constructed starting from genomic or cDNA sequences coding 
for phytases Known in the state of the art [for sequence Information see references mentioned above, e.g. EP 684 313 

so or sequence data bases, for example like Genbank (Intelligenetics. Galifornia, USA), Europ^n Bioinfbmiatics Institute 
(Hinston Hall, Cambridge, GB), NBRF (Georgetown University. Medical Centre, Washington DC, USA) and Vecbase 
(University of Wisconsin, Biotechnology Centre. Madison. Wisconsin, USA) or disclosed in the figures by methods of in 
vitro mutagenesis [see ag. Sambrook et al.. Molecular Cloning, GoW Spring Harkwr Laboratory Press, New York], A 
widely used strategy for such "site directed mutagenesis", as originally outlined by Hurchinson and Edgell [J. Virol. 8. 

25 181 (1971)], involves the annealing of a synthetic oligonucleotide carrying the desired nudeotkJesubstituti 

region of a singte-stranded DNA sequence wherein the mutation should be introduced [for review see Smith, Annu. Rev. 
Genet. 19. 423 (1985) and for improved methods see references 2-6 in Stanssen et al.. Nucl. Acid Res.. IZ 4441-4454 
(1989)]. Another possibility of mutating a given DNA sequence which is also preferred for the practice of the present 
invention is the mutagenesis by using the polymerase chain reaction (PGR). DNA as starting material can be isolated 

30 by methods known in the art and descn*bed e.g. in Sambrook et al. (Molecular Cloning) from the respective strains. For 
strain information sea ag. EP 684 313 or any depository authority indicated below. Aspergillus nrger [ATCC 9142], 
Myceliophthora thermophila [ATCC 48102], Talaromyces thermophilus [ATCC 201 86] and Aspergillus fumigatus [ATCC 
34625) have been redeposited on March 14, 1997 according to the conditions of the Budapest Treaty at the American 
Type Culture Cell Collection under the following accession numbers: ATCC 74337. ATCC 74340. ATCC 74338 and 

35 ATCC 74339, respectively. It is however, understood that DNA encoding a phytase to be mutated in accordance with 
the present invention can also be prepared on the basis of a known DNA sequence, ag. as shown in Rg. 6 in a syn- 
thetic manner and described e.g. in EP 747 483 by methods known in the art 

[001 9] Once complete DNA sequences of the present invention have been obtained they can be integrated into vec- 
tors by methods known in the art and descnlied e.g. in Sambrook et al. (s.a) to overexpress the encoded polypeptide 

40 in appropriate host systems. However, a man skilled in the art knows that also the DNA sequences themselves can be 
used to transform the suitatjie host systems of the invention to get overexpression of the encoded polypeptida Appro- 
priate host systems are for example fungi, like Aspergilli. ag. Aspergillus niger [ATCC 9142] or Aspergillus ficuum 
[NRRL 3135] or like Trichoderma. ag. Trichoderma reesei or y^sts, like Saccharomyces, ag. Saccharomyces cerevi- 
siae or Pichia. like Pichia pastoris. or Hansenula polymorpha, ag. H. polymorpha (DSM5215). A man skilled in the art 

45 knows tiiat such microorganisms are available from depository authorities, e.g. the American Type Culture Collection 
(ATCC). the Gentraalbureau voor Schimmelcultures (CBS) or the Deutsche Sammlung fOr Mikroorganismen und Zel- 
Ikulturen GmbH (DSM) or any other depository authority as listed in the Journal "Industrial Property" ((1991) 1. pages 
29-40]. Bacteria which can be used are ag. E. coti. Bacilli as, ag. Bacillus subtilis or Streptomyces. e.g. Streptomyces 
lividans (see e.g. Ann6 and Mallaert in FEMS Microbiol. Letters UA> 121 (1993). E. coli. which couki be used are E. 

so coli K12 strains ag. M15 [described as D2 291 by Villarejo et al. in J. Bacteriol. 12Q, 466-474 (1974)]. HB 101 [ATCC 
No. 33694] or E. coli SGI 3009 [Qottesman et at., J. Bacteriol. IM. 265-273 (1981)]. 

[0020] Vectors which can be used for expression in fungi are known in the art and descn'bed ag. in EP 420 358. or 
by Cullen et al. [Bio/Technology & 369-376 (1987)] or Ward in Molecular Industrial Mycology, Systems and Applications 
for Filamentous Fungi. Marcel Dekker, New York (1991), Upshall etal, [Bio/Technology §. 1301-1304 (1987)] Gwynne 
55 et al. [Bio/Technology 5. 71-79 (1987)], Punt et al. [J. Biotechnol. IL 19-34 (1991)] and for yeast by Sreekrishna et al. 
[J. Basic Microbiol. 2S. 265-278 (1988). Biochemistry 28, 4117-4125 (1989)]. Hitzemann et al. [Nature 2^ 717-722 
(1981)1 or in EP 183 070. EP 1 83 071 . EP 248 227. EP 263 31 1 . Suitable vectors which can be used for expression in 
E. coli are mentioned, e.g. by Sambrook el al. [aa.] or by R rs et al. in Procd. 8tii Int. Biotechnology Symposium" [Soc. 
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Franc, de Microbiol., Paris (Durand et al., eds.). pp. 680-697 (1988)] or by Bujard et al. in Methods in Enzymolog/, eds. 
Wu and Grossnnann. Academic Press. Inc. Vol. 155. 416-433 (1987) and Stuber el al. In Immunological Methods, eds. 
Lefkovits and Pernis, Academic Press, Inc., Vol. IV. 121-152 (1990). Vectors which could be used for expression in 
Bacilli are known In the art and described, e.g. in EP 405 370. Procd. Natl. Acad. Sci. USA QX» 439 (1984) by Yansura 

s and Henner, Meth. Enzymol. 185. 199-228 (1990) or EP 207 459. Vectors which can be used for the expression in H. 
Polymorpha are known in the art and described, e.g. in Gellissen et al.. Biotechnology 291-295 (1991). 
[0021] Either such vectors already carry regulatory elements, e.g. promotors, or the DNA sequences of the present 
invention can be engineered to contain such elements. Suitable promotor elements which can be used are known in the 
art and are. e.g. for Trichoderma reesei the cbhl- [Haarki et al.. Biotechnology L 596-600 (1989)] or the pkil -promotor 

10 [Schindler et al.. Gene 271-275 (1993)]. for Aspergillus oryzae the amy-promotor [Christensen et al., Abstr. 19th 
Lunteren Lectures on Molecular Genetics F23 (1987). Christensen et al.. Biotechnology fi. 1419-1422 (1988). Tada et 
al., Md. Gen. Genet. 229. 301 (1991)]. for Aspergillus niger the glaA- [Cullen et al., Bio/Technology g. 369-376 (1987), 
Gwynne et al.. Bio/Technology ^ 713-719 (1987). Ward in Molecular Industrial Mycology. Systems and Applications for 
Filamentous Fungi, Marcel Dekker, New York. 83-106 (1991)]. alcA- [Gwynne etal., Bio/Technology 5. 718-719 (1987)], 

IS suci- [Boddy et al.. Curr. Genet 2^ 60-66 (1993)]. aphA- [MacRae et al.. Gene 11 339-348 (1988). MacRae et al.. 
Gene m 193-198 (1993)]. tpiA- [McKnight et al., Cell 4fi. 143-147 (1986). Upshal! et al., Bio/Technology 5. 1301-1304 
(1987)]. gpdA- [Punt et al.. Gene ga. 49-57 (1988). Punt et al.. J. Biotechnol. IZ 19-37 (1991)] and the pWA-promotor 
[de Graaff et al.. Curr. Genet. ^ 21-27 (1992)]. Surtat^le promotor elements which could be used for expression in 
yeast are known in the art and are, e.g. the pho5-promotor [Vogel et al.. Mol. Cell. Biol.. 2050-2057 (1989); Rudolf and 

20 Hinnen, Proc. Natl. Acad. Sci. M. 1340-1344 (1987)] or the gap-promotor for expression in Saccharomyces cerevisiae 
and for Pichia pastoris, e.g. the aoxl -promotor [Koutz et aL. Yeast g. 167-177 (1989); Sreekrishna et al.. J. Basic Micro- 
biol. 2& 265-278 (1988)]. or the FMD promoter [Hollenberg et al.. EPA No. 0299108] or MOX-promotor [Ledeboer et al., 
Nucleic Acids Res. 1^ 3063-3082 (1985)] for H. polymorpha. 

[0022] Accordingly vectors comprising DNA sequences of the present invention, preferably for the expression of said 
25 DNA sequences in bacteria or a fongal or a yeast host and such transformed bacteria or fungal or yeast hosts are also 
an object of the present invention. 

[0023] Once such DNA sequences have been expressed In an appropriate host cell in a suitable medium the encoded 
phytase can be isolated either from the medium in the case the phytase is secreted into the medium or from the host 
organism in case such phytase is present Intracellularly by methods known in the art of protein purification or described. 
30 e.g. in EP 420 358. Acconjingly a process for the preparation of a polypeptide of the present invention characterized in 
that transformed bacteria or a host cell as described above is cultured under suitable culture conditions and the 
polypeptide is recovered therefrom and a polypeptide when produced by such a process or a polypeptide encoded by 
a DNA sequence of the present invention are also an object of the present invention. 

[0024] Phytases of the present invention can be also expressed in plants according to methods as described, e.g. by 

35 Pen et al. in Bio/Technology 11. 81 1-814 (1994) or in EP 449 375, preferably in seeds as described. e.g. in EP 449 376. 
[0025] For exarrple, a DNA sequence encoding a phytase of the present invention can be placed under the control 
of regulatory sequences from the gene encoding the 12S storage protein cruciferin from Brassica napus. The construct 
is thereafter subcloned into a binary vector such as pMOG23 (in E. coli K-12 strain DH5cx. deposited at the Centraal 
Bureau voor Schimmelcultures. Baarn. The Netherlands under accession number CBS 102.90). This vector is intro- 

40 duced into Agrot}acterium tumefaciens which contains a disarmed Ti plasmid. Bacterial cells containing this contruct 
are co-cultivated with tissues from tobacco or Brassica plants, and transformed plant cells are selected by nutrient 
media containing antibiotics and induced to regenerate into differentiated plants on such media. The resulting plants 
will produce seeds that contain and express the DNA contruct. Or the phytase-encoding DNA sequence can be placed 
under the control of regulatory sequences from the 35S promoter of Cauliflower Mosaic Virus (CaMV). The contruct is 

45 thereafter subcloned into a binary vector. This vector is then introduced into Agrobacterium tumefaciens which contains 
a disarmed Ti plasmid. Bacterial cells containing this construct are cocultivated with tissues from tobacco or Brassica 
plants, and transformed plant cells are selected by nutrient media containing antibiotics and induced to regenerate into 
differentiated plants on such media. The resulting plants contain and express the DNA construct constitutively. 
[0026] The plant or plant part containing phytase can be used directly for the preparation of a feed composition or can 

60 be extracted from plants or plant organs by methods known in the art. Accordingly it is also an object of the present 
invention to provide a process for tiie production of tiie phytases of the present invention in plants or plant organs, like 
seeds, the phytases when produced by such methods, the transformed plants and plant organs, like seeds itself. 
[0027] Once obtained the polypeptides of the present invention can be characterized regarding their properties which 
make them useful in agriculture any assay known in the art and described e.g. by Simons et al. [Br. J. Nutr. M. 525-540 

55 (1990)], SchOner et al. [J. Anim. Physiol, a. Anim. Nutr. fig. 248-255 (1991)]. Vogt [Arch. GeflOgelk. 5fi. 93-98 (1992)], 

Jongbloed et al. [J. Anim. Sd.. 20, 1159-1168 (1.992)]^ ^ v 

[J. Anim. Physiol, a. Anim. Nutr. 69. 278^2fit3 (1 993). Brbz et al., [Br. Poultry Sci. M. 273-280 (1 994)] and DOngelhoef 
et al. [Animal Feed Sci. Technol. 42. 1-10 (1994)] can be used. 
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[0028] In general the polypeptides of the present invention can be used without being limited to a specific field of appli- 
cation for the conversion of inositol polyphosphates, like phytate to inositol and inorganic phosphate. 
[0029] Furthermor the polypeptides of the present inv rrtion can be used in a process for the preparati n of com- 
pound food or feeds wherein the components of such a composition are mixed with one or more polypeptk^es of the 
present invention. Accordingly compound food or feeds comprising one or more polypeptides of the present invention 
are also an object of the present invention. A man skilled in the art is familiar with their process of preparation. Such 
compound foods or feeds can further comprise additives or components generally used for such purpose and known in 
the state of the art 

[0030] It is furthernrore an object of the present invention to provide a process for the reduction of levels of phytate in 
animal manure characterized in that an animal is fed such a feed composition in an amount effective in converting 
phytate contained in the feedstuff to inositol and inorganic phosphate. 

[0031] Before describing the present invention in more detail a short explanation of the enclosed Rgures is given 
below. 



15 Roure Leoends 
[0032] 
FiQure 1: 



Primary sequence alignment of A niger (ficuum), A terreus cbsl 1 6.46 and A. fumigatus [ATCC 1 3073] 
phytase. Stars show identical residues within the active site and rectangles, non-idartical residues 
within the active site. 
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Hgure 2 : pH optima curves. Specific activity of wild-type and mutant A fumigatus phytases is plotted against pH 
of incubation. Riled squares represent A fumigatus wild-type phytase; Open triangles represent A. 
fumigatus Q27L mutant: Riled circles represent A. fumigatus Q27U Q274L mutant; Open squares rep- 
resent A fumigatus QZ7U Q274U G277D mutant 



Rgure 3 : Substrate specificities of wild-type and mutant A. fumigatus phytases. (A) wild-type; (B) Q27L single 
mutant; (C) Q27L^ Q274U G277D triple mutant The following substrates were used: (1) phytic acid; (2) 
30 p-nitrophenyl phosphate; (3) fructose- 1 .6-bisphosphate; (4) fructose-6-phosphate; (5) glucose-6-phos- 

phate; (6) ribose-5-phosphate; (7) a-glycerophosphate; (8) ^-glycerophosphate; (9) 3-phosphoglycer- 
ate; (10) phosphoenolpyruvate; (11) AMP; (12) ADP; (13) ATP. 
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Rgure 4 : Complete coding sequence and encoded amino acid sequence of the Aspergillus nidulans phytase. 

Rgure 5 : Complete coding sequence and encoded amino acid sequence of Talaromyces thermophilus phytase. 

Roure 6 : Complete coding sequence and encoded amino add sequence of Aspergillus fumigatus [ATCC 13073] 
phytase. 



Roure 7 : Complete coding sequence and encoded amino acid sequence of Aspergillus terreus CBS 116.46 
phytase. 



45 



so 



55 



Rgure 8 : Crystallographic data of the structure of the Aspergillus niger phytase. 

Rgure 9 : Substrate specificities of wild-type and mutant A fumigatus phytase (N1 -N6). Substrates 1 to 13 are as 
indicated for Rgure 3. 

Rgure 10 : pH optima curves of f urtiier mutant A fumigatus phytases (N1 -N6). All activity values were standardized 
(n^mum activity = 1 .0). 

Rgure 11a : Stereo picture of the three<limensional fbW of A niger (A ficuum; NRRL 3135) phytasa The active site 
is indicated with a circle and the catalytically essential amino acid residues Arg 58 and His 59 are shown 
in t)all-and-stick representation. This figure was prepared with the programs IV^OLSCRIPT' [Kraulis. 
PJ., J. Appl. Cryst. 2^ 946-950 (1991)] and "RASTER3D" [Men-itt E.A & Murphy. M.E.R, Acta Cryst, 
869-873(1994)]. 



Rgure lib : Topological sketch, using the same scheme as in (a). The five disulphide bridges are shown as black 
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Figure 12: 



2S 



75 

FiqMrQig: 

20 Figure 14a : 
Figure 14b: 
FjgMr9l$: 
Figure 16 : 

Figure 17a 



30 Figure 1 7b 



Figgreiyc: 



zigzag lines together with the sequence numbers of the cysteine residues involved. The p-strands are 
defined with the sequence numbers A: 48-58. B: 134-138. C: 173-177. D: 332-337. E: 383-391 . and F: 
398-403. The a-helices are defined with the sequence numbers a: 66-82. b: 88-95. c: 107-123, d: 141- 
159, e: 193-197, f: 200-210. g: 213-223. h: 231-246. i: 257-261. j: 264-281. k: 290-305. 1: 339-348. m: 
423-429, and n: 439-443. The asterisk at the C-terminal end of p-strand A marks the location of the cat- 
alytically essential amino acid residues Arg 58 and His 59. 

Stereo picture of the active site of A. f icuum (ATCC 13073) phytase with a hypothetical binding mode of 
the sutetrate phytate. In this model, the bound crystal water molecules were removed and the protein 
atom positions were held fixed, except for smalt adaptations of the side chain torsion angles of Lys 68 
in order to interact with the substrate. All the conserved amino acid residues Arg 58. His 59. Arg 62, Arg 
142. His 338 and Asp 339 form hydrogen bonds to the sclssile 3-phosphate group of phytate. as indi- 
cated with lines of small dots. His 59 is in a favorable position to make a nudeophilic attack at the scls- 
sile phosphorous, indicated with a tine of larger dots, and Asp 339 is in a position to protonate the 
leaving group. 

Construction of the basic plasmids pUC18-AfumgDNA and pUC18-AfumcDNA for site directed muta- 
genesis. 

c 

Primer sets A-N used for site directed mutagenesis. 
Primer sets O-T used for site directed mutagenesis. 
Construction of plasmids pgDNAT1-pgDNAT7. 
Construction of plasmids pgDNANI -pgDNAN6. 
Construction of plasmids pcTI - pcT7. 

Construction of plasmids pcTI-Avrll. pcTl-S66D and pcT1-S140Y-D141Q 
Construction of plasmids pcDNA-N27. -T27. -127. -V27. -A27. -G27. 
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H gure 18 : Construction of plasmids pcNI - pcN6. 

Figure 1 9 : Plasmid pAfum-TI for the expression of mutein T1 in Aspergillus niger. 

Figure 20 : pH optima curves. Specific activity of wild-type and mutant A fumigatus phytases is plotted against pH 
of incubation. Open triangles: A fumigatus [ATCC 13073] wild-type phytase; Open rhomt^s: A. 
fumigatus Q27G phytase; Filled squares: A fumigatus Q27N phytase; Filled triangles: A fumigatus 
Q27V phytase; Open squares: A fumigatus Q27A phytase; Filled circles: A fumigatus Q27I phytase; 
Open circles: A fumigatus Q27T phytase; Dashed line: A fumigatus Q27L phyt^a 

Figure 21 : Substrate specificities of wild-type and mutant A fumigatus [ATCC 13073] phytases. The used sub- 
strates 1-13 are the same as mentioned in Figure 3. The specific activities of the different phytases with 
any one of the 13 substrates tested are given in the following order (from left to right): A. fumigatus wild- 
type phytase, A. fumigatus Q27N phytase. A. fumigatus Q27T phytase. A fumigatus Q27L phytase. A, 
fumigatus Q27I phytase. A. fumigatus Q27V phytase. A. fumigatus Q27A phytase, A. fumigatus Q27G 
phytase. 

Figure 22 : pH optima curves. Specific activity of wild-type and mutant A fumigatus [ATCC 13073] phytases is plot- 
ted against pH of incubation. Riled rhombs: A. fumigatus wild-type phytase; Filled squares: A 
fumigatus Q27L single nuitant; Open circles: A, fumigatus Q27L-S66D double mutant; Filled triangles: 
A fumigatus Q27L-S1 40 Y-D1 41 G triple mutant. 

Figure 23 : Natural variation of phytases in different Isolates of A fumigatus [ATCC 13073]. The predicted protein 
sequences are shown and compared to that of the phytase from A, fumigatus strain ATCC 13073. Only 
the amin acids which differ fr m those in #13073 are shown. 
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Fggre 24: pH dependent specific activity of phytases isolated from two different A fumigatus wildtype strains. 
Open squares: wild-type strain ATCC 13073; Riled circles: strain ATCC 32239. 

Rrgure 25: Substrate specificities of phytases isolated from two different A fumigatus wildtype strains. Black bars: 
wild-type strain ATCC 13073; White bars: strain ATCC 32239. 

Rroure 26 : Construction of plasmids pc-Sl30N, pc-R129L-S130N. pc-K167G-R168Q. 

Examples 

Pcample 1 

Homology Modelino of A, fumiaatus a nd A terreus cbs116.46 phvtase 

[0033] The amino add sequences of A fumigatus [ATCC 13073] {see Rgure 1) and A terreus cbsl 16.46 phytase 
(see Rgure 1 ) were compared with the sequence of A niger (ficuum) phytase (see Rgure 1 ) for which the three-dimen- 
sional structure had been determined by X-ray crystallography. Crystallographtc data are given in Rgure 8. 
[0034] A multiple amino acid sequence alignment of A. niger (ficuum) phytase, A. fumigatus phytase and A terreus 
cbsl 1 6.46 phytase was calculated with the pn^gram "PILEUP" (Prog. Menu for the Wisconsin Package, version 8, Sep- 
tember 1994, Genetics Computer Group. 575 Science Drive. Madison Wisconcin. USA 53711). The three-dimensional 
nrxjdels of A fumigatus phytase and A terreus cbsl 16.46 phytase were built by using the structure of A niger (ficuum) 
phytase as template and exchanging the amino adds of A niger (ficuum) phytase according to the sequence alignment 
to amino acids of A fumigatus and A terreus cbsl 16.46 phytases. respectively. Model construction and energy optimi- 
zation were performed by using the program Moloc (Gerber and MQiler. 1995). C-alpha positions were kept fixed except 
for new insertions/deletions and in loop regions distant from the active site. 

[0035] Only small differences of the modelled structures to the original crystal structure could be observed in external 
loops. Furthermore the different sutistrate molecules that mainly occur on the degradation pathway of phytic acid (myo- 
inositol-hexaWsphosphate) by Pseudomonas sp, bacterium phytase and. as far as determined, by A niger (ficuum) 
phytase (Cosgrove. 1980; Rg. 1) were constructed and forged into the active site cavity of each phytase structure. Each 
of these sutjstrates was oriented in a hypothetical binding mode proposed for histidtne acid phosphatases (Van Etten. 
1 982). The sdssile phosphate group was oriented towards the catalylically essential His 59 to form the covalent phos- 
phoenzyme intermediate. TTie oxygen of the substrate phosphoester bond which will be protonated by Asp 339 after 
cleavage was orientated towards the proton donor. Conformational relaxation of the reniaining structural part of tiie 
substrates as well as the surrourKling active site residues was performed by energy optimization with the program 
Moloc. 

[0036] Based on the structure models the residues pointing into the active site cavity were identified. More than half 
(60%) of these positions were identical between these three phytases. whereas only few positions were not conserved 
(see Rgure 1). This obsen/ation could be extended to four additional phytase sequences (A nidulans. A ten-eus 9A1 . 
Talaromyces thermophilus. Myceliophthora thermophila). 

[0037] The results coming from sequence alignment and structural information including favourable enzyme-substrate 
interactions were combined to define tiie positions for mutational analysis which are shown in Table 1 . 

References: 

[0038] 

Gerber. P. and Muller, K. (1995) Moloc molecular modeling software. J. Comput. Aided Mol. Des. 2. 251-268 

Van Etten, R.L (1982) Human prostatic acid phosphatase: a histidine phosphatase. Ann. NY Acad. Sci. 390.27-50 

Cosgrove. D.J. (1980) Inositol phosphates - their chemistry, biochemistry and physiology: studies in organic chem- 
istry, chapter 4. Elsevier Scientific Publishing Company. Amsterdam. Oxford. New York. 

Example 2> 

Construction of plasmids oUCia-AfumoDNA and pUClS-AfumcDNA 

[0039] Plasmids pUC18-AfumgDNA and pUC18-AfumcDNA the basic constructs for all the A fumigatus muteins 
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described below were constructed as follows. 

[0040] pUC18-AfumgDNA: The genomic DNA sequence of the phytase gene of Aspergillus fumigatus was obtained 
by PGR using the "Expand^** High Fidelity PGR Kit" (Boehringer Mannheim. Mannheim, German^ with primers #39 and 
#40 (designed on the basis of the genomic sequence shown in Figure 6) and genomic DNA of Aspergillus fumigatus 
s [ATGG 13073] from the A. fumigatus (NIH stock 5233) genomic library in a Lambda Fixll vector [Stratagene. Lugolla. 
GA 92037. USA; catalog No. 946055). 

Primer #39: 
10 BspHI 

y TAT ATC ATC AT T ACT CTG ACT TTC CTG CTT TCG 3^ 
MI TLTFLLS 

Primer #40: 

EcoRV 

3^ CCT CTC ACG AAA TC A ACT CTA TAG ATA TAT 5^ 
G E C F S * 



[0041] Ttie reaction mix included 10 pmol of each primer and 200 ng of template DNA. 35 rounds of amplification 
were done with the following cycling values: 95 "C, 1 min/56 1 miny72 ^C. 90 sec. The PCR-ampWied Aspergillus 
fumigatus mutein genes had a new BspHI site at the ATG start codon. introduced with primer #39. which resulted in the 
25 change of the second amino acid from a valine to an isoleucine. Furthermore, an EcoRV site was created with primer 
#40 downstream of the TGA termination codon of the gene. 

[0042] The PGR fragment (approx. 1450 bp) was subsequently cloned into the Smal site of pUG18 using the "sure 
clone Kit** (Boehringer Mannheim s.a.) according to the supplier's recommendations. The resulting plasmid was named 
pUG18-AfumgDNA. 

30 [0043] pUG18-AfumcDNA: This plasmid lacks the intron (small gap letters in Rgure 6) of the A. fumigatus phytase 
gene and was constructed as outlined in Figure 13. Briefly, using primers Fum28 and Fumi 1 the 5* end of exon 2 was 
amplified by PGR (see below), digested with Ncol and EagI (new restriction site introduced with primer Fum28) and 
ligated together with the linker coding for exon 1 made of primers Fum26 and Funri27 into the Xbal and Ncol sites of 
pUGI 8-AfumgDNA, thereby resulting in plasmid pUGI 8-Af umcDN A. 

35 



40 



45 



55 
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Fum28: 

5' ATATAT£fifi£££AGTGTCTGCGGCACCTAGT 3' 
EagI 

Fumll: 

5' TGAGGTCATCCGCACCCAGAG 3' 
Fum26: 

5' CTAGAATTCATGGTGACTCTGACTTTCCTGCTTTCGGCGGCGTATCT 
GCTTTCC 3' 

Fum27: 

5^ GGCCGGAAAGCAGATACGCCGCCGAAAGCAGGAAAGTCAGAGTC 
ACCATGAATT 3' 

PGR reaction to get 5* end of exon 2 of the A. fumigatus phytase: 

2 ^1 template: pUC1d-AfumgDNA (20 ng) 

1 }l\ dNTTP's-mix (Boehringer Mannheim s.a.) 

5 Ml lOx Buffer 

1 \i\ Taq polymerase (Boehringer Mannheim s.a.} 
1.9 Fum11 (=10pmol) 

2 \i\ Fum28 (=10 pmol) 
37,1 Mi HgO 

[0044] In total 35 cycles with the temperature profile: 95*C for 30 sec/56*C for 30 sec/ 72*C for 45 sec were made. 
The amplified fragment (approx. 330 bp) was extracted once with an equal volume of phenol/chlorofbrm (1 :1). To the 
recovered aqueous phase 0.1 volume of 3 M sodium acetate. pH 4.8 and 2.5 volumes of ethano! were added. The mix- 
ture was centrifuged for 10 min at 12000 g and the pellet resuspended in 20 fjU of HgO. Subsequently, the purified frag- 
ment was digested with Ncol and EagI and processed as outlined above. 

Example 3 

ConstOiCtion of muteins of the Phvtase of Aspergillus fumioatus for expression in A. niqer 

[0045] To construct all muteins for the expression in A. niger, plasnr^ pUC18-AfumgDNA was used as template for 
site<Jirected mutagenesis. Mutations were introduced using the "quick exchange™ site<lirected mutagenesis Wr from 
Stratagene (La Jolla, CA. USA) following the manufacturer's protocol and using the con-esponding primers (Rgure 14). 
All mutations made are sumnuuized in Table 1 A and B wherein T1 to T7 and N1 to N6« respectively, refer to the muteins 
and "Mutation" to the amino acids replaced at such position. For example T5 refers to a mutein witii a double nruitation: 
L at position 27 for Q and L at position 274 for Q. The primer sets (A-H) used to introduce the con-esponding mutations 
are shown in Figure 14a The newly introduced amino acid is shown in bold and the sidsscript indicates the position in 
the mature Aspergillus fumigatus enzyme concerning to tiie numbering of the A. niger amino acid sequence. Rgures 
15 and 16 outiine the scheme for the construction of different plasmids pgT1-pgT7 aird pgN1-pgN6 ericoding the 
muteins carrying only one mutation (T1-T4: N1-N3) or more mutations (T5-T7; N4-N6). Clones hart>oring the desired 
mutations were identified by DNA sequence analysis as known in the art The mutated phytases were verified by com- 
plete sequencing of the genes. 
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Example 4 

Construction of muieins of the phytase of AsDeraillus f umiaatus for expression in Saccha romvces cerevisia 

5 [0046] Construction of plasmids pcTI - pcT7 (Rgure 17a) and pcN1 - pcN6 (Figure 18), respectively, encoding the 
muteins T1-T7 and N1-N6 for the expression in S. cerevisiae was basically done as outlined in Example 3. Instead of 
using pUCI 8-AfumgDNA as the basic construct to introduce the nnutations, plasmid pUCI 8-Af umcDNA was used (Fig- 
ure 13). 

[0047] The plasmids pcDNA-N27. -G27, -V27. -A27. -127 and -T27 encoding the muteins N27, Q27. V27, A27, 127 

10 and T27 were constructed as follows: 

[0048] A Silent restriction site for Avrll was Introduced Into plasmid pcT1 by site directed mutagenesis as described In 
Example 3 using primer set I (Figure 14a; Rgure 17b). The A fumigatus phytase gene fragment Avrll/Xhol was then 
replaced by the linker fragment harbouring the desired mutations (Figure 17c). Each linker fragment was generated by 
annealing of the respective pairs of synthesized polynucleotides (Rg. 14b; sense and antisense strand; 90 ng each) for 

IS 3 min at 70 **C in 9 \l\ distilled water. 

[0049] Construction of plasmids pcT1-S66D and pcT1-S140Y-D141G encoding the A. fumigatus Q27L-S66D double 
mutant and the A fumigatus Q27L-S140Y-D141Q triple mutant was basically carried out as described In Example 3. 
Plasmid pcTI . hartx)urlng the mutation codir^ for Q27L. was used as template for site directed mutagenesis together 
with the corresponding primer sets J and K (Figure 1 4a; Rgure 1 7b). 

20 [0050] All mutations were verified by DNA sequence analysis of the entire gene. 

Example 5 

^XPrQgsion in Aspgrqiltu^ piqer 

25 

[0051] The genes encoding the aforementioned A. funnigatus wild-type phytase and muteins (Rg. 16) were isolated 
with BspHl and EcoRV from plasmids pgDNAT1-pgDNAT7 and pgDNANI-pgDNANS and ligated into the Ncol site 
downstream of the glucoamylase promoter of Aspergillus niger (glaA) and the EcoRV site upstream of the Aspergillus 
nidulans tryptophan C terminator (trpC) (Mutlaney et al.. 1985). The resulting expression plasmids had in addition the 

30 orotidine-S'iahosphate decarboxylase gene {pyrA) of Neurospora crassa as selection marker. Rgure 19 shows an 
example for such an expression plasmid carrying the gene encoding mutein T1 (van den Hondel et al., 1991). The basic 
expression plasmid described above con^esponds basically to the pGLAC vector described in example 9 of BP 684 313. 
Transformation of Aspergillus niger and expression of the muteins was done as described in EP 684 313. 
[0052] The supernatant was concentrated by way of ultrafiltration In Amicon 8400 cells (PM30 membranes) and uKra- 

35 free-15 centrifugal filter devices (Biomax-30K. Millipore). 

[0053] The concentrate (typically 1.5-5 ml) was desalted in aliquots of 1.5 ml on a Fast Desalting HR 10/10 column 
(Pharmacia Biotech), with 10 mM sodium acetate, pH 5.0, serving as elution buffer. The desalted A. fumigatus samples 
were directly loaded onto a 1 .7 ml Poros HS/M cation exchange chromatography column (PerSeptive Biosystems, 
Framingham. MA, USA). A terreus cbsl 16.46 [CBS 220.95] phytase was directly loaded onto a 1.7 ml Poros HQ/M 

40 anion exchange chromatography column. In both cases, phytase was eluted in pure form by way of a sodium chtoride 
gradient 

References: 

45 [0054] 

Mullaney. E. J.. J. E. Hamer, K. A. Roberti, M. M. Yelton. and W. E. Timberlake. 1985. Primary structure of thia trpC 
gene from Aspergillus nidulans. Mol. Gen. Genet. 199:37-45. 

Van den Hondel, C. A M. J. J.. P. J. Punt, and R. F. M. van Gorcom. 1991, Heterologous gene expression in fila- 
50 mentous fungi. In: More gene manipulations in fungi, pp. 396-428. Bennett, J. W. and Lasure, L. L. (eds.). Aca- 
demic Press Inc.. San Diego, CA 

Example 6 

55 Expression in Saccharomvces cerevisiae 

[0055] The intron less genes encoding the A fumigatus wild-type phytase and the different muteins (Rg. 1 7/1 8) men- 
tioned abov were isolated fr m the respective plasmids pUC18-AfumcDNA, pcDNATI - pcDNAT7 arid pcDNANI - 
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PCDNAN6 with EcoRI and EcoRV and subdoned either between the blunt ended Xhol and the EcoRi sites of plasmid 
pYES2 (Invitrogen, San Diego, CA. USA) or the shortened GAPFL (glyceraldehyde-3-phosphate dehydrogenase) pro- 
moter and the PH05 terminator as described by Janes et al. (1990). Transformation of Saccharomyces cerevisiae 
strains, e.g. INVScI (Invitrogen, San Diego, CA, USA) was done according to Hinnen et ai, (1978). Single colonies har- 

5 bouring the phytase gene under the control of the GAPFL promoter were picked and cultivated in 5ml selection medium 
(SD -uracil) (Sherman et al.. 1986) at 30" C under vigorous shaking (250 rpm) for 1 day. The preculture was then added 
to 500 ml YPD medium (Sherman et al., 1 986) and cultivated under the same concfitions. After four days cell broth was 
centrifuged (7000 rpm. GS3 rotor. 15 min. 5** C) and the supernatant was collected. Induction of the GAL1 promoter 
(plasmid pYES2 from Invitrogen. San Diego, CA, USA) was done according to the nranufacturers instructions. Purifica- 

10 tion of the muteins was as described in example 5 (s.a.). 

References: 
[0056] 

15 

Jeuies, M., B. Meyhack. W. Zimmermann and A. Hinnen. 1990. the influence of GAP promoter variants on hirudine 
production, average plasmid copy number and cell growth in Saccharomyces cerevisiae. Curr. Genet 18: 97-103 

Hinnen, A., J.B. Hicks and G.R. Rnk. 1978. Proc. Nati. Acad. Sd. USA 75: 1929-1933 

20 

Sheman, J.R. Rnck. Q.R. and Hicks. J.B. (1986). Laboratory Course Manual for Methods in Yeast Genetics. Cold 
Spring Harbor University Press. 

Example 7 

25 

Determination of o hvtase activity and substrate soecificitv 

[0057] Phytase activity was measured in an assay mixture containing 0.5% phytic add (-5 mM), 200 mM sodium ace- 
tate. pH 5.0. After 15 min incubation at 37«C. the reaction was stopped by addition of an equal volume of 15% trichlo- 
30 roacetic add. The libwated phosphate Ions were quantified by mixing 1 00 pil of the assay mixture witii 900 ^1 H2O and 
1 ml of 0.6 M H2SO4, 2% ascorbic acid and 0.5% ammonium molybdate. Standard solutions of potassium phosphate 
were used as reference. 

[0058] In case of pH optimum curves, purified enzymes were diluted in 10 mM sodium acetate. pH 5.0. Incubations 
were started by mixing aliquots of the diluted protein with an equal volume of 1% phytic acid (-10 mM) in a series of 
35 different buffers: 0.4 M glydne/HG. pH 2.5; 0.4 M acetate/NaOH, pH 3.0. 3.5. 4.0. 4.5. 5.0, 5.5; 0.4 M imidazole/HCI, 
pH 6.0, 6.5; 0.4 M Tris/HCI. pH 7.0, 7.5, 8.0, 8.5, 9.0. Control experiments showed that pH was only dightiy affected by 
the mixing step. Incubations were performed for 15 min at 37'*C as described above. 

[0059] For determination of the siAjstrate specificities of wild-type and mutant A fumtgatus phylases. phytic add in 
the assay mixture was replaced by 5 mM -concentrations of the respective phosphate compounds. The activity tests 

40 were perfbmied as described atxive. 

[0060] Protein concentrations were calculated from the OD at 280 nm. using theoretical absorption values calculated 
from ttie known protein sequences with the DNA* software (DNASTAR, Inc.. Madison. Wisconsin. USA). An atjsorption 
of 1 .0 OD at 280 nm corresponds to 0.94 mg/ml A fumigatus phytase and 0.85 mg/ml of A terrsus cbsl 1 6.46 phytase. 
[0061] pH profiles of Aspergillus fumigatus mutants T1 (Q27L), T5 (Q27U Q2744 and T6 (Q27L, Q274U Q277D) 

45 have drastically changed compared to the wild-type A fumigatus phytase (see Figure 2). All mutants showed equal pH 
profiles. Increase in specific activity at pH 5.0 of the muteins as compared to ttie wild-type phytase of Aspergillus fumi- 
gatus is shown in Table 2. Enzyme activities were measured under standard assay conditions at pH 5.0. Several indi- 
vidual measurements (n: number of assays) were averaged. 

[0062] The pH profile of A fumigatus phytase mutant Q27A resembles the pH profile of A. fumigatus wild-type 
so phytase over neariy tiie whole pH range (Rgure 20). Whereas the specific activity of wild-type phytase is decreasing at 
pH values below pH 4.0, the specific activity of the phytase mutant Q27A remains neariy constant down to pH 2.9. 
[0063] The single amino acid exchanges Q27U Q27I, Q27V or Q27T have remarkably increased the specific activity 
over the whole pH range, especially between pH 5.0 and 7.5 (Figure 20). Maximum values are reached at pH 6.5. In 
addition, mutation Q27T caused the highest spedf ic activity values for phytic acid at low pH (pH 3.0-5.0). 
55 [0064] Higher specific activities are also gained by the single mutations Q27G or Q27N, between pH 2.5 and 7.0. with 
maximum values at pH 6.0 (Figure 20). The specific activity decreases at pH values below 3.5. 
[0065] All single mutants still show a broad substrate specificity which is comparable to tfiat of A fumigatus wild-type 
phytase (Figure 21). Some of the mutants show significantly higher specific activities than other mutants for selected 
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substrates, e. g.. the Q27T mutant for p-nitrophenyl phosphate and ATP, or the Q27G mutant for phosphoenolpyruvate. 
[0066] As shown in Figure 22 the combination of mutation Q27L with S66D or S1 40Y and D1 41 G led to a shift of the 
pH profile towards lower pH. The maximum specific activity gained by the single mutation Q27L is further increased by 
the additional amino acid exchanges. 
5 [0067] As shown in Figure 3, Aspergillus fumigatus phytase mutant T1 (Q27L} showed no difference in substrate spe- 
cificity compared to the triple mutant T6 (Q27U Q274L. Q277D). 

[0068] The pH profiles of the muteins N1 -6. except N2 show significant differences compared to the wild-type phytase 
(Fig. 10). Whereas the pH profile of mutein N4 is expanded towards lower pH, the profiles of muteins N3 to N6 are 
shifted towards lower pH. The muteins N5. N6 reach maximum activity already at pH 3.0. 
10 [0069] The muteins N1 to N6 show In almost all cases a drastic reduction in specific activity for all tested substrates, 
except for phytic acid (Rg. 9). Specific activity for phytic acid remained unchanged compared to the wild-type phytase, 
whereas mutant N3 and N6 show a tendentiat higher activity (Rg. 19). 



A) Mutations towards A terreus cbsll6.46 phjrtase 



20 


Mutation 


Tl 


T2 


T3 


T4 


IS 


T6 


T7 




Q27L 


X 








X 


X 


X 


2S 


Q274L 




X 






X 


X 


X 




G277D 






X 






X 


X 


30 




















N340S 








X 






X 



35 

B) Mutations towards A, niger (ficuum) pliytase 



40 


Mutation 


Nl 


•N2 


N3 


N4 


N5 


N6 


G277K 


X 






X 


X 


X 


45 


A205E 




X 




X 




X 




Y282H 






X 




X 


X 



50 



Table 2 







U/mg 


A fumigatus wild-type phytase 


26.5 ±5.2 


22 
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Table 2 (continued) 







U/mg 


A fumigatus Q27L 


83.4 


4 


A fumigatus Q27L. Q274L 


88.7 ±13.5 


8 


A fumigatus Q27L. Q274L, G277D 


92.3+12.0 


9 


A terreus cbsl 16.46 phytase 


195.8417.8 


7 



Tabled 



Specific activity under standard assay conditions at pH 5.0. Average standard deviation is 
10%. 




Specific activity [UAng] 


Number of independent 
assays 


A. fumigatus wild-type phytase 


26.5 


22 


A fumigatus Q27N 


45.5 


3 


A fumigatus Q27T 


106.9 


3 


A fumigatus Q27L 


83.4 


4 


A fumigatus Q27I 


91.2 


3 


A. fumigatus Q27V 


35.0 


3 


A fumigatus Q27A 


27.3 


3 


A- fumigatus Q27Q 


59.6 


3 


A fumigatus Q27L-S66D 


118.5 


3 


A fumigatus Q27L-S140Y-D141G 


193.0 


3 



Example 8 

[0070] As an alternative approach to obtain phytases with modified characteristics and to get a better idea about the 
natural variation found in phytase characteristics within a certain species, naturally occurring variants of A fumigatus 
phytase were analysed. Phytase genes were obtained from six different isolates of A fumigatus, TTie amino add 
sequence of phytase from two of the A fumigatus isolates (ATCC 26934 and ATCC 34625) showed no difference to the 
original amino acid sequence of wild-type A fumigatus phytase ATCC 13073. Phytase from three other isolates had 
one or two amino acid si^itutions, none of which directly affected the active site. Enzymatic characteristics remained 
unaffected by these substitutions (not shown). The phytase of isolate of A fumigatus (ATCC 32239) cfiffered in 13 posi- 
tions in the signal sequence and 51 positions in the mature part of the protein compared to the original wild-type A 
fumigatus phytase (ATCC 13073). Several of ttiese sut^stitutions affect variable amino adds of the active site cavity. 
This resulted in an increase in specific activity with phytic acid as substrate (47 UAng, standard enzyme assay) and in 
loss of enzymatic activity above pH 7 (Fig. 24). Also in this case, the specific activity against phytic acid was increased 
relative to the specific activities witti other sut>strates (Rg. 25). 

Example 9 

[0071] Construction of plasmids pc-S130N. pc-R129L-S130N. pc-K167Q-R168Q encoding A fumigatus [ATCC 
13073] phytase S130N single mutant and R129L-S130N double mutant and A nidulans phytase K167G-R168Q dout»le 
mutant was basically carried out as described in Example 3. Plasnrvd pUC18-Af umcDNA was used as template for site 
directed mutagenesis togetiier with the corresponding primer sets M and N (Rgure 14a; Rgure 26). 
[0072] All mutations were verified by DfMA sequence analysis of the entire gena 
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Example 10 

[0073] When expressed in A. niger and stored as concentrated culture supernatants at 4*C. the phytases from A. 
fumigatus, A nidulans displayed tendency to undergo proteolytic degradation. N-terminal sequencing of fragments 
5 suggested that cleavage occured between amino acids S130*V131 and K167-R168 or R168-A169. respectively Com- 
pared with 3D structure of A, niger phytase revealed that all cleavage sites are found within surface- exposed loop struc- 
tures and are therefore accessible to proteases. 

[0074] Site-directed mutagenesis at protease-sensitive sites of A. fumigatus phytase (S1 30N, R1 29L-S1 30N) and A. 
nidulans phytase (K167G-R168Q) yielded mutant proteins with considerably reduced susceptik}ility to proteolysis. 
10 [0075] In contrast to expression in A, niger, proteolytic degradation was not ot^erved when the phytases were 
expressed in Hansenula polymorpha. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: F. Hoffmann-La Roche AG 

(B) STREET: Grenzacherstrasse 124 

(C) CITY: Basle 

(D) STATE: BS 

(E) COUNTRY: Switzerland 

(F) POSTAL CODE {ZIP) : CH-4002 

(G) TELEPHONE: 061-6882505 

(H) TELEFAX: 061-6881395 

(I) TELEX: 962292/965542 hlr ch 

(ii) TITLE OF INVENTION: Modified Phytases 
(iii) NUMBER OF SEQUENCES: 8 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

<C) OPERATING SYSTEM; PC -DOS /MS-DOS 

(D) SOFTWARE: PatentIn Release «1.0. Version *1.30 {EPO} 

(V) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: EP 98104858.0 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1931 base pairs 

(B) TYPE:, nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: join(158. .204, 259.. 1600) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

TCTGTAACCG ATAGCGGACC GACTAGGCAT CGTTGATCCA CAATATCTCA GACAATGCAA 
60 

CTCAGTCGAA TATGAAGGGC TACAGCCAGC ATTTAAATAC GGCCGTCTAG GTCG<3GCTCC 
120 

GGGGATGAGG AGGAGCAGGC TCGTGTTCAT TTCGGTC ATG C^T TTT TTC ACG (3TC 
175 

Met Ala Phe Phe Thr Val 
1 5 

GCT CTT TCG CTT TAT TAG TTG CTA TCG AG GTGAGATCTC TACAATATCT 
224 

Ala Leu Ser Leu Tyr Tyr Leu Leu Ser Arg 
10 15 

GTCTGCTTAG TTGAATTGGT ACTTATCTGT ACAG A GTC TCT GCT CAG GCC CCA 
277 

Val Ser Ala Gin Ala Pro 
20 
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10 



IS 



so 



35 



40 



45 



60 



GTG 
325 
Val 


GTC 
Val 


CAG 

Gin 
25 


AAT 
Asn 


CAT 
His 


TCA 
Ser 


TGC 
Cys 


AAT 

Asn 
30 


ACG 
Thr 


GCG 
Ala 


GAC 
Asp 


GGT 
Gly 


GGA TAT 

Gly Tyr 
35 


CAA 
Gin 


TGC 
Cys 


TTC 
373 
Phe 


CCC 

Pro 
40 


AAT 
Asn 


GTC 
Val 


TCT 
Ser 


CAT 
His 


GTT 

Val 
45 


TGG 
Trp 


GGT 
Gly 


CAG 
Gin 


TAC 
Tyr 


TCG 

Ser 
50 


CCG 
Pro 


TAC 
Tyr 


TTC 
Phe 


TCC 
Ser 


ATC 
421 
lie 
55 


GAG 
Glu 


CAG 
Gin 


GAG 
Glu 


TCA 
Ser 


GCT 

Ala 
€0 


ATC 
He 


TCT 
Ser 


GAG 
Glu 


GAC GTG 

Asp Val 
65 


CCT 
Pro 


CAT 
His 


GGC TGT 
Gly Cys 


GAG 

Glu 
70 


GTT 
469 
Val 


ACC 
Thr 


TTT 
Phe 


GTG 
Val 


CAG 

Gin 
75 


GTG 
val 


CTC 
Leu 


TCG 
Ser 


CGG 
Arg 


CAT 

His 
80 


GGG 
Gly 


GCT 
Ala 


AGG 
Arg 


TAT 
Tyr 


CCG 

Pro 
85 


ACA 
Thr 


GAG 
517 
Glu 


TCG 
Ser 


AAG 
Lys 


AGT 

Ser 
90 


AAG 
Lys 


GCG 
Ala 


TAC 
Tyr 


TCG 
Ser 


GGG 

Gly 
95 


TTG 
Leu 


ATT 
He 


GAA 
Glu 


GCA ATC 

Ala He 
100 


CAG 
Gin 


AAG 
Lys 


AAT 
565 
Asn 


GCT 
Ala 


ACC 

Thr 
105 


TCT 
Ser 


TTT 
Phe 


TGG 
Trp 


GGA 
Gly 


CAG 

Gin 
110 


TAT 
Tyr 


GCT 
Ala 


TTT 
Phe 


CTG 
Leu 


GAG 

Glu 
115 


AGT 
Ser 


TAT AAC 
Tyr Asn 


TAT 
613 
Tyr 


ACC 

Thr 
120 


CTC 
Leu 


GGC 
Gly 


GCG 
Ala 


GAT 
Asp 


GAC 

Asp 
125 


TTG 
Leu 


ACT 
Thr 


ATC 
lie 


TTC 
Phe 


GGC 

Gly 
130 


GAG 
Glu 


AAC 
Asn 


CAG 
Gin 


ATG 
Met 


GTT 
661 
Val 
135 


GAT 
Asp 


TCG 
Ser 


GGT 
Gly 


GCC 
Ala 


AAG 

Lys 
140 


TTC 
Phe 


TAC 
Tyr 


CGA 
Arg 


CGG 
Arg 


TAT 

ryr 
145 


AAG 
Lys 


AAT 
Asn 


CTC 
Leu 


GCC 
Ala 


AGG 

Arg 
150 


AAA 
709 
Lys 


AAT 
Asn 


ACT 
Thr 


CCT 
Pro 


TTT 

Phe 
155 


ATC 
lie 


CGT 
Arg 


GCA 
Ala 


TCA 
Ser 


GGG 

Gly 
160 


TCT 
Ser 


GAC 
Asp 


CGT 
Arg 


GTC 
Val 


GTT 

Val 
165 


GCG 
Ala 


TCT 
757 
Ser 


GCG 
Ala 


GAG 
Glu 


AAG 

Lys 
170 


TTC 
Phe 


ATT 
lie 


AAT 
Asn 


GGA 
Gly 


TTT 

Phe 
175 


CGC 
Arg 


AAG 
Lys 


GCT 
Ala 


CAG 
Gin 


CTC 

Leu 
180 


CAC 
His 


GAC 
Asp 


CAT 
805 
His 


GGC 
Gly 


TCC 

Ser 
185 


AAA 
Lys 


CGT 
Arg 


GCT 
Ala 


ACG 
Thr 


CCA 

Pro 
190 


GTT 
Val 


GTC 
Val 


AAT 
Asn 


GTG 
Val 


ATT 

He 
195 


ATC 
He 


CCT 
Pro 


GAA 
Glu 


ATC 
853 
lie 


GAT 

Asp 
200 


GGG 
Gly 


TTT 
Phe 


AAC 
Asn 


AAC 
Asn 


ACC 

Thr 
205 


CTG 
Leu 


GAC 
Asp 


CAT 
His 


AGC 
Ser 


ACG 

Thr 
210 


TGC 
Cys 


GTA 
Val 


TCT 
Ser 


TTT 
Phe 


GAG 
901 
Glu 
215 


AAT 
Asn 


GAT 
Asp 


GAG 
Glu 


CGG 
Arg 


GCG 

Ala 
220 


GAT 
Asp 


GAA 
Glu 


ATT 
He 


GAA 

Glu 


GCC 

Ala 
225 


AAT 
Asn 


TTC 
Phe 


ACG 
Thr 


GCA 
Ala 


ATT 

He 
230 


ATG 
949 
Met 


GGA 
Gly 


CCT 
Pro 


CCG 
Pro 


ATC 

lie 
235 


CGC 
Arg 


AAA 
Lys 


CGT 
Arg 


CTG 
Leu 


GAA 

Glu 
240 


AAT 
Asn 


GAC 
Asp 


CTC 
Leu 


CCT 
Pro 


GGC 

Gly 
245 


ATC 
He 
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AAA CTT ACA AAC GAG AAT GTA ATA TAT TTG ATG GAT ATG TGC TCT TTC 
997 

Lys Leu Thr Asn Glu Asn Val He Tyr Leu Met Asp Met Cys Ser Phe 
250 255 260 

GAC ACC ATG GCG CGC ACC GCC CAC GGA ACC GAG CTG TCT CCA TTT TGT 
1045 

Asp Thr Met Ala Arg Thr Ala His Gly Thr Glu Leu Ser Pro Phe Cys 
265 270 275 

GCC ATC TTC ACT GAA AAG GAG TGG CTG CAG TAC GAC TAG CTT CAA TCT 
1093 

Ala lie Phe Thr Glu Lys Glu Trp Leu Gin Tyr Asp Tyr Leu Gin Ser 
280 285 290 

CTA TCA AAG TAC TAC GGC TAC GGT GCC GGA AGC CCC CTT GGC CCA GCT 
1141 

Leu Ser Lys Tyr Tyr Gly Tyr Gly Ala Gly Ser Pro Leu Gly Pro Ala 
295 300 305 310 

CAG GGA ATT GGC TTC ACC AAC GAG CTG ATT GCC CGA CTA ACG CAA TCG 
1189 

Gin Gly He Gly Phe Thr Asn Glu Leu He Ala Arg Leu Thr Gin Ser 
315 320 325 

CCC GTC CAG GAC AAC ACA AGC ACC AAC CAC ACT CTA GAC TCG AAC CCA 
1237 

Pro Val Gin Asp Asn Thr Ser Thr Asn His Thr Leu Asp Ser Asn Pro 
330 335 340 

GCC ACA TTT CCG CTC GAC AGG AAG CTC TAC GCC GAC TTC TCC CAC GAC 
1285 

Ala Thr Phe Pro Leu Asp Arg Lys Leu Tyr Ala Asp Phe Ser His Asp 
345 350 355 

AAT AGC ATG ATA TCG ATA TTC TTC GCC ATG GGT CTG TAC AAC GGC ACC 
1333 

Asn Ser Met He Ser He Phe Phe Ala Met Gly Leu Tyr Asn Gly Thr 
360 365 370 

CAG CCG CTG TCA ATG GAT TCC GTG GAG TCG ATC CAG GAG ATG GAC GGT 
1381 

Gin Pro Leu Ser Met Asp Ser Val Glu Ser He Gin Glu Met Asp Gly 
375 380 385 390 

TAC GCG GCG TCT TGG ACT GTT CCG TTT GGT GCG AGG GCT TAC TTT GAG 
1429 

Tyr Ala Ala Ser Trp Thr Val Pro Phe Gly Ala Arg Ala Tyr Phe Glu 
395 400 405 

CTC ATG CAG TGC GAG AAG AAG GAG CCG CTT GTG CGG GTA TTA GTG AAT 
1477 

Leu Met Gin Cys Glu Lys Lys Glu Pro Leu Val Arg Val Leu Val Asn 
410 415 420 

GAT CGC GTT GTT CCT CTT CAT GGC TGC GCA GTT GAC AAG TTT GGA CGG 
1525 

Asp Arg Val Val Pro Leu His Gly Cys Ala Val Asp Lys Phe Gly Arg 
425 430 435 

TGC PCT TTG GAC GAT TGG GTA GAG GGC TTG AAT TTT GCA AGG AGC GGC 
1573 

Cys Thr Leu Asp Asp Trp Val Glu Gly Leu Asn Phe Ala Arg Ser Gly 
440 445 450 

GGG AAC TGG AAG ACT TGT TTT ACC CTA TAAAGGGCGT TTGCTCATTC 
1620 

Gly Asn Trp Lys Thr Cys Phe Thr Leu 
455 460 
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ATAAGTGTTG TGCAGGTATA GGAAGGTTAG GGAATTAGCT GTTTGGCTTT ACTCTTATTA 
1680 

GACCAAGAAT GATTTGTTTG TTCTCAAGGC CTTCTAGCAT ATCGTCAAGT GGGATAAATC 
1740 

ACCTATCCTC CATGTGTAGG TGAACCCGCT CTTGCATCAA CCTCTTGTGT TTCAGAGTAG 
1600 

TTTCACCAAA CATATCCTCG TGTCCTCTCT TCTGCTCTTC GGTCTCATAT TACACTGTTC 
1860 

TCTATCTATA TCGTCAACAA AACTACCACC CAAACACCAA ATGTCACACT TTCCAGCACG 
1920 

AAATTTCTTC G 
1931 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 463 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Phe Phe Thr Val Ala Leu Ser Leu Tyr Tyx Leu Leu Ser Arg 
15 10 15 

Val Ser Ala Gin Ala Pro Val Val Gin Asn His Ser Cys Asn Thr Ala 
20 25 30 

Asp Gly Gly Tyr Gin Cys Phe Pro Asn Val Ser His Val Trp Gly Gin 
35 40 45 

Tyr Ser Pro Tyr Phe Ser lie Glu Gin Glu Ser Ala He Ser Glu Asp 
50 55 60 

Val Pro His Gly Cys Glu Val Thr Phe Val Gin Val Leu Ser Arg His 
65 70 75 80 

Gly Ala Arg Tyr Pro Thr Glu Ser Lys Ser Lys Ala Tyr Ser Gly Leu 
85 90 95 

He Glu Ala He Gin Lys Asn Ala Thr Ser Phe Trp Gly Gin Tyr Ala 
100 105 110 

Phe Leu Glu Ser Tyr Asn Tyr Thr Leu Gly Ala Asp Asp Leu Thr He 
115 120 125 

Phe Gly Glu Asn Gin Met Val Asp Ser Gly Ala Lys Phe Tyr Arg Arg 
130 135 140 

Tyr Lys Asn Leu Ala Arg Lys Asn Thr Pro Phe He Arg Ala Ser Gly 
145 150 155 160 

Ser Asp Arg Val Val Ala Ser Ala Glu Lys Phe He Asn Gly Phe Arg 
165 170 175 

Lys Ala Gin Leu His Asp His Gly Ser Lys Arg Ala Thr Pro Val Val 
180 185 190 

Asn Val He He Pro Glu He Asp Gly Phe Asn Asn Thr Leu Asp His 
195 200 205 

Ser Thr Cys Val Ser Phe Glu Asn Asp Glu Arg Ala Asp Glu He Glu 
210 215 220 
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Ala Asn Phe Thr Ala He Mec Gly Pro Pro He Arg Lys Arg Leu Glu 
225 '230 235 240 

Asn Asp Leu Pro Gly He Lys Leu Thr Asn Glu Asn Val He Tyr Leu 
245 250 255 

Met Asp Met Cys Ser Phe Asp Thr Met Ala Arg Thr Ala His Gly Thr 
260 265 270 

Glu Leu Ser Pro Phe Cys Ala He Phe Thr Glu Lys Glu Trp Leu Gin 
275 280 285 

Tyr Asp Tyr Leu Gin Ser Leu Ser Lys Tyr Tyr Gly Tyr Gly Ala Gly 
290 295 300 

Ser Pro Leu Gly Pro Ala Gin Gly He Gly Phe Thr Asn Glu Leu He 
305 310 315 320 

Ala Arg Leu Thr Gin Ser Pro Val Gin Asp Asn Thr Ser Thr Asn His 
325 330 335 

Thr Leu Asp Ser Asn Pro Ala Thr Phe Pro Leu Asp Arg Lys Leu Tyr 
340 345 350 

Ala Asp Phe Ser His Asp Asn Ser Met He Ser He Phe Phe Ala Met 
355 360 365 

Gly Leu Tyr Asn Gly Thr Gin Pro Leu Ser Met Asp Ser Val Glu Ser 
370 375 380 

He Gin Glu Met Asp Gly Tyr Ala Ala Ser Trp Thr Val Pro Phe Gly 
385 390 395 400 

Ala Arg Ala Tyr Phe Glu Leu Met Gin Cys Glu Lys Lys Glu Pro Leu 
405 410 415 

Val Arg Val Leu Val Asn Asp Arg Val Val Pro Leu His Gly Cys Ala 
420 425 430 

Val Asp Lys Phe Gly Arg Cys Thr Leu Asp Asp Trp Val Glu Gly Leu 
435 440 445 

Asn Phe Ala Arg Ser Gly Gly Asn Trp Lys Thr Cys Phe Thr Leu 
450 455 460 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1845 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: join(288. .334. 390.. 1740) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

TTCCACGCTG AAAGCCTGAG TGCGATTTCC AAGCTGCATG CAGGCTGCTC AACTGCCTGC 
60 

TTATCTTCAT CAGACGCAGA TACACAACCT GGTCTGTAGA TGCACCCATG ACGGACGAAC 
120 
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GCACCGCTCT CTTGGCCTCC AGGGACCCGG AGGTCGAGGG CGATGAGGTC GCGCCCTCGA 
180 

CGGCCTCCCA GTCCCTGTTG CAGTTGAGAT CTCGCTGCGA ACGTCGACCG CAGATATGGT 
5 240 

TGTCTTCGAC GTTTTCTCGC CTTCGAGGAA GAATTGCTGC TGTGACG ATG AGT CTG 
296 

Met Ser Leu 
1 

10 TTG TTG CTG GTG CTG TCC GGC GGG TTG GTC GCG TTA TA GTATGCTCCT 

344 

Leu Leu Leu Val Leu Ser Gly Gly Leu Val Ala Leu Tyr 
5 10 15 

TCTCTCTGGT CATATTGTTT TCTGCTAACG TTCTCATAAT TGAAG T GTC TCA AGA 
399 ' ' 

IS Val Ser Arg 

AAT CCG CAT GTT GAT AGO CAC TCT TGC AAT ACA GTG GAA GGA GGG TAT 
447 

Asn Pro His Val Asp Ser His Ser Cys Asn Thr Val Glu Gly Gly Tyr 
20 25 30 35 



20 



35 



40 



45 



CAG TGT CGT CCA GAA ATC TCC CAC TCC TGG GGC CAG TAT TCT CCA TTC 
495 

Gin Cys Arg Pro Glu lie Ser His Ser Trp Gly Gin Tyr Ser Pro Phe 
40 45 50 

TTC TCC CTG GCA GAC CAG TCG GAG ATC TCG CCA GAT GTC CCA CAG AAC 
543 

Phe Ser Leu Ala Asp Gin Ser Glu lie Ser Pro Asp Val Pro Gin Asn 
55 60 65 



75 80 



TGC 


AAG 


ATT 


591 






Cys 


Lys 


He 






70 


CCT 


ACG 


TCT 


639 






Pro 


Thr 


Ser 




85 




CAG 


AAG 


ACG 


687 






Gin 


Lys 


Thr 


100 






TAC 


AGA 


TAC 


735 






Tyr 


Arg 


Tyr 


CAG 


ATG 


ATC 


783 






Gin 


Met 


He 


GCC 


AG6 


AAT 


831 






Ala 


Arg 


Asn 






150 


ATT 


GCC 


TCG 



90 95 



105 110 115 



120 125 130 



135 140 145 



155 160 
GCC TCG GGG AGA CTT TTC ATC GAA GGT TTC CAG AGC GCC AAA GTG 

SO 879 

He Ala Ser Gly Arg Leu Phe He Glu Gly Phe Gin Ser Ala Lys Val 
165 170 175 



55 
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CTG GAT CCT CAT TCA GAC AAG CAT GAC GCT CCT CCC ACG ATC AAC GTG 
927 

Leu Asp Pro His Ser Asp Lys His Asp Ala Pro Pro Thr lie Asn Val 
180 185 190 195 

ATC ATC GAG GAG GGT CCG TCC TAC AAT AAC ACG CTC GAC ACC GGC AGC 
975 

lie lie Glu Glu Gly Pro Ser Tyr Asn Asn Thr Leu Asp Thr Gly Ser 
200 205 210 

TGT CCA GTC TTT GAG GAC AGC AGC GGG GGA CAT GAC GCA CAG GAA AAG 
1023 

Cys Pro Val Phe Glu Asp Ser Ser Gly Gly His Asp Ala Gin Glu Lys 

215 220 225 

TTC GCA AAG CAA TTC GCA CCA GCT ATC CTG GAA AAG ATC AAG GAC CAT 
1071 

Phe Ala Lys Gin Phe Ala Pro Ala lie Leu Glu Lys lie Lys Asp His 
230 235 240 

CTT CCC GGC GTG GAC CTG GCC GTG TCG GAT GTA CCG TAC TTG ATG GAC 
1119 

Leu Pro Gly Val Asp Leu Ala Val Ser Asp Val Pro Tyr Leu Met Asp 
245 250 255 

TTG TGT CCG TTT GAG ACC TTG GCT CGC AAC CAC ACA GAC ACG CTG TCT 
1167 

Leu Cys Pro Phe Glu Thr Leu Ala Arg Asn His Thr Asp Thr Leu Ser 
260 265 270 275 

CCG TTC TGC GCT CTT TCC ACG CAA GAG GAG TGG CAA GCA TAT GAC TAC 
1215 

Pro Phe Cys Ala Leu Ser Thr Gin Glu Glu Trp Gin Ala Tyr Asp Tyr 
280 285 290 

TAC CAA AGT CTG GGG AAA TAC TAT GGC AAT GGC GGG GGT AAC CCG TTG 
1263 

Tyr Gin Ser Leu Gly Lys Tyr Tyr Gly Asn Gly Gly Gly Asn Pro Leu 
295 300 305 

GGG CCA GCC CAA GGC GTG GGG TTT GTC AAC GAG TTG ATT GCT CGC ATG 
1311 

Gly Pro Ala Gin Gly Val Gly Phe Val Asn Glu Leu He Ala Arg Met 
310 315 320 

ACC CAT AGC CCT GTC CAG GAC TAC ACC ACG GTC AAC CAC ACT CTT GAC 
1359 

Thr His Ser Pro Val Gin Asp Tyr Thr Thr Val Asn His Thr Leu Asp 
325 330 335 

TCG AAT CCG GCG ACA TTC CCT TTG AAC GCG ACG CTG TAC GCA GAT TTC 
1407 

Ser Asn Pro Ala Thr Phe Pro Leu Asn Ala Thr Leu Tyr Ala Asp Phe 
340 345 350 355 

AGC CAC GAC AAC ACA ATG ACG TCA ATT TTC GCG GCC TTG GGC CTG TAC 
1455 

Ser His Asp Asn Thr Met Thr Ser He Phe Ala Ala Leu Gly Leu Tyr 
360 365 370 

AAC GGG ACC GCG AAG CTG TCC ACG ACC GAG ATC AAG TCC ATT GAA GAG 
1503 

Asn Gly Thr Ala Lys Leu Ser Thr Thr Glu He Lys Ser He Glu Glu 
375 380 385 

ACG GAC GGC TAC TCG GCG GCG TGG ACC GTT CCG TTC GGG GGG CGA GCC 
1551 

Thr Asp Gly Tyr Ser Ala Ala Trp Thr Val Pro Phe Gly Gly Arg Ala 
390 395 400 
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TAT ATC GAG ATG ATG CAG TGT GAT GAT TCG GAT GAG CCA GTC GTT CGG 
1599 

Tyr lie Glu Met Met Gin Cys Asp Asp Ser Asp Glu Pro Val Val Arg 
405 <410 415 

GTG CTG GTC AAC GAC CGG GTG GTG CCA CTG CAT GGC TGC GAG GTG GAC 
1647 

Val Leu Val Asn Asp Arg Val Val Pro Leu His Gly Cys Glu Val Asp 
420 425 430 435 

TCC CTG GGG CGA TGC AAA CGA GAC GAC TTT GTC AGG GGA CTG AGT TTT 
1695 

Ser Leu Gly Arg Cys Lys Arg Asp Asp Phe Val Arg Gly Leu Ser Phe 
440 445 450 

GCG CGA CAG GGT GGG AAC TGG GAG GGG TGT TAC GCT GCT TCT GAG 
1740 

Ala Arg Gin Gly Gly Asn Trp Glu Gly Cys Tyr Ala Ala Ser Glu 
455 460 465 

TAGGTTTATT CAGCGAGTTT CGACCTTTCT ATCCTTCAAA CACTGCACAA AGACACACTG 
1800 

CATGAAATGG TAACAGGCCT GGAGCGTTTT AGAAGGAAAA AAGTT 
1845 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 466 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQXJENCE DESCRIPTION: SEQ ID NO : 4: 

Met Ser Leu Leu Leu Leu Val Leu Ser Gly Gly Leu Val Ala Leu Tyr 
15 10 15 

Val Ser Arg Asn Pro His Val Asp Ser His Ser Cys Asn Thr Val Glu 
20 25 30 

Gly Gly Tyr Gin Cys Arg Pro Glu He Ser His Ser Trp Gly Gin Tyr 
35 40 45 

Ser Pro Phe Phe Ser Leu Ala Asp Gin Ser Glu He Ser Pro Asp Val 
50 55 60 

Pro Gin Asn Cys Lys He Thr Phe Val Gin Leu Leu Ser Arg His Gly 
65 70 75 BO 

Ala Arg Tyr Pro Thr Ser Ser Lys Thr Glu Leu Tyr Ser Gin Leu He 
85 90 95 

Ser Arg He Gin Lys Thr Ala Thr Ala Tyr Lys Gly Tyr Tyr Ala Phe 
100 105 110 

Leu Lys Asp Tyr Arg Tyr Gin Leu Gly Ala Asn Asp Leu Thr Pro Phe 
115 120 125 

Gly Glu Asn Gin Met He Gin Leu Gly He Lys Phe Tyr Asn His Tyr 
130 135 140 

Lys Ser Leu Ala Arg Asn Ala Val Pro Phe Val Arg Cys Ser Gly Ser 
145 150 155 160 

Asp Arg Val He Ala Ser Gly Arg Leu Phe He Glu Gly Phe Gin Ser 
165 170 175 
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Ala Lys Val Leu Asp Pro His Ser Asp Lys His Asp Ala Pro Pro Thr 
180 185 190 

lie Asn Val lie He Glu Glu Gly Pro Ser Tyr Asn Asn Thr Leu Asp 
195 200 205 

Thr Gly Ser Cys Pro Val Phe Glu Asp Ser Ser Gly Gly His Asp Ala 
210 215 220 

Gin Glu Lys Phe Ala Lys Gin Phe Ala Pro Ala He Leu Glu Lys He 
225 230 235 240 

Lys Asp His Leu Pro Gly Val Asp Leu Ala Val Ser Asp Val Pro Tyr 
245 250 255 

Leu Met Asp Leu Cys Pro Phe Glu Thr Leu Ala Arg Asn His Thr Asp 
260 265 270 

Thr Leu Ser Pro Phe Cys Ala Leu Ser Thr Gin Glu Glu Trp Gin Ala 
275 280 285 

Tyr Asp Tyr Tyr Gin Ser Leu Gly Lys Tyr Tyr Gly Asn Gly Gly Gly 
290 295 300 

Asn Pro Leu Gly Pro Ala Gin Gly Val Gly Phe Val Asn Glu Leu lie 
305 310 315 320 

Ala Arg Met Thr His Ser Pro Val Gin Asp Tyr Thr Thr Val Asn His 
325 330 335 

Thr Leu Asp Ser Asn Pro Ala Thr Phe Pro Leu Asn Ala Thr Leu Tyr 
340 345 350 

Ala Asp Phe Ser His Asp Asn Thr Met Thr Ser He Phe Ala Ala Leu 
355 360 365 

Gly Leu Tyr Asn Gly Thr Ala Lys Leu Ser Thr Thr Glu He Lys Ser 
370 375 380 

He Glu Glu Thr Asp Gly Tyr Ser Ala Ala Trp Thr Val Pro Phe Gly 
385 390 395 400 

Gly Arg Ala Tyr He Glu Met Met Gin Cys Asp Asp Ser Asp Glu Pro 
405 410 415 

Val Val Arg Val Leu Val Asn Asp Arg Val Val Pro Leu His Gly Cys 
420 425 430 

Glu Val Asp Ser Leu Gly Arg Cys Lys Arg Asp Asp Phe Val Arg Gly 
435 440 445 

Leu Ser Phe Ala Arg Gin Gly Gly Asn Trp Glu Gly Cys Tyr Ala Ala 
450 455 460 

Ser Glu 
465 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1571 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 
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(A) NAME/ KEY: CDS 

(B) LOCATION: join (43. .89, 147.. 1494) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

AGATTCAACG ACGGAGGAAT CGCAACCCTA ATTGTCGGTA TC ATG GTG ACT CTG 
54 

Met Val Thr Leu 

- ■ • 1 ■ 

ACT TTC CTG CTT TCG GCG GCG TAT CTG CTT TCT GG GTGAGTGGCT 
99 

Thr Phe L.eu Leu Ser Ala Ala Tyr Leu Leu Ser Gly 
5 10 15 

TGGATCTATT GCTCGGATAG GGCTGTGGTG CTGATTCTGA AACGGAG T AGA GTG 
153 

Arg Val 

TCT GCG GCA CCT ACT TCT GCT GGC TCC AAG TCC TGC GAT ACG GTA GAC 
201 

Ser Ala Ala Pro Ser Ser Ala Gly Ser Lys Ser Cys Asp Thr Val Asp 
20 2S 30 

CTC GGG TAC CAG TGC TCC CCT GCG ACT TCT CAT CTA TGG GGC CAG TAC 
249 

Leu Gly Tyr Gin Cys Ser Pro Ala Thr Ser His Leu Trp Gly Gin Tyr 
35 40 45 50 

TCG CCA TTC TTT TCG CTC GAG GAC GAG CTG TCC GTG TCG AGT AAG CTT 
297 

Ser Pro Phe Phe Ser Leu Glu Asp Glu Leu Ser Val Ser Ser Lys Leu 
55 60 65 

CCC AAG GAT TGC CGG ATC ACC TTG GTA CAG GTG CTA TCG CGC CAT GGA 
345 

Pro Lys Asp Cys Arg lie Thr Leu Val Gin Val Leu Ser Arg His Gly 
70 75 80 

GCG CGG TAC CCA ACC AGC TCC AAG AGC AAA AAG TAT AAG AAG CTT GTG 
393 

Ala Arg Tyr Pro Thr Ser Ser Lys Ser Lys Lys Tyr Lys Lys Leu Val 
85 90 95 

ACG GCG ATC CAG GCC AAT GCC ACC GAC TTC AAG GGC AAG TTT GCC TTT 
441 

Thr Ala lie Gin Ala Asn Ala Thr Asp Phe Lys Gly Lys Phe Ala Phe 
100 105 110 

TTG AAG ACG TAC AAC TAT ACT CTG GGT GCG GAT GAC CTC ACT CCC TTT 
489 

Leu Lys Thr Tyr Asn Tyr Thr Leu Gly Ala Asp Asp Leu Thr Pro Phe 
115 120 125 130 

GGG GAG CAG CAG CTG GTG AAC TCG GGC ATC AAG TTC TAC CAG AGG TAG 
537 

Gly Glu Gin Gin Leu Val Asn Ser Gly lie Lys Phe Tyr Gin Arg Tyr 
135 140 145 

AAG GCT CTG GCG CGC AGT GTG GTG CCG TTT ATT CGC GCC TCA GGC TCG 
585 

Lys Ala Leu Ala Arg Ser Val Val Pro Phe lie Arg Ala Ser Gly Ser 
150 155 160 

GAC CGG GTT ATT GCT TCG GGA GAG AAG TTC ATC GAG GGG TTC CAG CAG 
633 

Asp Arg Val He Ala Ser Gly Glu Lys Phe He Glu Gly Phe Gin Gin 
165 170 175 
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25 



30 



35 



40 



GCG AAG CTG GCT GAT CCT GGC GCG ACG AAC CGC GCC GCT CCG GCG ATT 
681 

Ala Lys Leu Ala Asp Pro Gly Ala Thr Asn Arg Ala Ala Pro Ala He 
180 185 190 

AGT GTG ATT ATT CCG GAG AGC GAG ACG TTC AAC AAT ACG CTG GAC CAC 
729 

Ser Val He He Pro Glu Ser Glu Thr Phe Asn Asn Thr r*eu Asp His 
195 200 205 210 

GGT GTG TGC ACG AAG TTT GAG GCG AGT CAG CTG GGA GAT GAG GTT GCG 
777 

Gly Val Cya Thr Lys Phe Glu Ala Ser Gin Leu Gly Asp Glu Val Ala 
215 220 225 

GCC AAT TTC ACT GCG CTC TTT GCA CCC GAC ATC CGA GCT CGC GCC GAG 
825 

Ala Asn Phe Thr Ala Leu Phe Ala Pro Asp He Arg Ala Arg Ala Glu 
230 235 240 



250 255 



AAG 


CAT 


CTT 


873 






Lys 


His 


Leu 






245 


ATG 


GAC 


ATG 


921 






Met 


Asp 


Met 




260 




CAG 


CTG 


TCA 


969 






Gin 


Leu 


Ser 


275 






TAC 


AAC 


TAC 


1017 




Tyr 


Asn 


Tyr 


AAC 


CCT 


CTG 


1065 




Asn 


Pro 


Leu 


GCC 


CGG 


TTG 


1113 




Ala 


Arg 


Leu 






325 


ACT 


CTA 


GTC 


1161 




Thr 


Leu 


Val 




340 





265 270 



280 285 290 



295 300 305 



310 315 320 



330 335 



345 350 



GTC GAC TTT TCA CAC GAC AAC AGC ATG GTT TCC ATC TTC TTT GCA TTG 
1209 

Val Asp Phe Ser His Asp Asn Ser Met Val Ser He Phe Phe Ala Leu 
45 355 360 365 370 

GGC CTG TAC AAC GGC ACT GAA CCC TTG TCC CGG ACC TCG GTG GAA AGC 
1257 

Gly Leu Tyr Asn Gly Thr Glu Pro Leu Ser Arg Thr Ser Val Glu Ser 
375 380 385 

60 GCC AAG GAA TTG GAT GGG TAT TCT GCA TCC TGG GTG GTG CCT TTC GGC 

1305 

Ala Lys Glu Leu Asp Gly Tyr Ser Ala Ser Trp Val Val Pro Phe Gly 
390 395 400 
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GCG CGA GCC TAC TTC GAG ACG ATG CAA TGC AAG TCG GAA AAG GAG CCT 
1353 

Ala Arg Ala Tyr Phe Glu Thr Met Gin Cys Lys Ser Glu Lys Glu Pro 
405 410 415 

5 

CTT GTT CGC GCT TTG ATT AAT GAC CGG GTT GTG CCA CTG CAT GGC TGC 
1401 

Leu Val' Arg Ala Leu lie Asn Asp Arg Val Val Pro Leu His Gly Cys 
420 425 430 

GAT GTG GAC AAG CTG GGG CGA TGC AAG CTG AAT GAC TTT GTC AAG GGA 
10 1449 

Asp Val Asp Lys Leu Gly Arg Cys Lys Leu Asn Asp Phe Val Lys Gly 
435 44C 445 450 

TTG AGT TGG GCC AGA TCT GGG GGC AAC TGG GGA GAG TGC TTT AGT 
1494 

Leu Ser Trp Ala Arg Ser Gly Gly Asn Trp Gly Glu Cys Phe Ser 
15 455 460 465 ' 

TGAGATGTCA TTGTTATGCT ATACTCCAAT AGACCGTTGC TTAGCCATTC ACTTCACTTT 
1554 



20 



30 



GCTCGAACCG CCTGCCG 
1571 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 65 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Val Thr Leu Thr Phe Leu Leu Ser Ala Ala Tyr Leu Leu Ser Gly 
15 10 15 

Arg Val Ser Ala Ala Pro Ser Ser Ala Gly Ser Lys Ser Cys Asp Thr 
20 25 30 

Val Asp Leu Gly Tyr Gin Cys Ser Pro Ala Thr Ser His Leu Trp Gly 
35 40 45 

^ Gin Tyr Ser Pro Phe Phe Ser Leu Glu Asp Glu Leu Ser Val Ser Ser 

SO 55 60 

Lys Leu Pro Lys Asp Cys Arg lie Thr Leu Val Gin Val Leu Ser Arg 
65 70 75 80 

His Gly Ala Arg Tyr Pro Thr Ser Ser Lys Ser Lys Lys Tyr Lys Lys 
40 85 90 95 

Leu Val Thr Ala lie Gin Ala Asn Ala Thr Asp Phe Lys Gly Lys Phe 
100 105 110 



45 



Ala Phe Leu Lys Thr Tyr Asn Tyr Thr Leu Gly Ala Asp Asp Leu Thr 
115 120 125 

Pro Phe Gly Glu Gin Gin Leu Val Asn Ser Gly lie Lys Phe Tyr Gin 
130 135 140 

Arg Tyr Lys Ala Leu Ala Arg Ser Val Val Pro Phe lie Arg Ala Ser 
145 150 155 160 

SO Gly Ser Asp Arg Val He Ala Ser Gly Glu Lys Phe He Glu Gly Phe 

165 170 175 
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Gin Gin Ala Lys Leu Ala Asp Pro Gly Ala Thr Asn Arg Ala Ala Pro 
180 185 190 

Ala lie Ser Val lie lie Pro Glu Ser Glu Thr Phe Asn Asn Thr Leu 
195 200 205 

Asp His Gly Val Cys Thr Lys Phe Glu Ala Ser Gin Leu Gly Asp Glu 
210 215 220 

Val Ala Ala Asn Phe Thr Ala Leu Phe Ala Pro Asp lie Arg Ala Arg 
225 230 235 240 

Ala Glu Lys His Leu Pro Gly Val Thr Leu Thr Asp Glu Asp Val Val 
245 250 255 

Ser Leu Met Asp Met Cys Ser Phe Asp Thr Val Ala Arg Thr Ser Asp 
260 265 270 

Ala Ser Gin Leu Ser Pro Phe Cys Gin Leu Phe Thr His Asn Glu Trp 
275 280 285 

Lys Lys Tyr Asn Tyr Leu Gin Ser Leu Gly Lys Tyr Tyr Gly Tyr Gly 
290 295 300 

Ala Gly Asn Pro Leu Gly Pro Ala Gin Gly lie Gly Phe Thr Asn Glu 
305 310 315 320 

Leu lie Ala Arg Leu Thr Arg Ser Pro Val Gin Asp His Thr Ser Thr 
325 330 335 

Asn Ser Thr Leu Val Ser Asn Pro Ala Thr Phe Pro Leu Asn Ala Thr 
340 345 350 

Met Tyr Val Asp Phe Ser His Asp Asn Ser Met Val Ser lie Phe Phe 
355 360 365 

Ala Leu Gly Leu Tyr Asn Gly Thr Glu Pro Leu Ser Arg Thr Ser Val 
370 375 380 

Glu Ser Ala Lys Glu Leu Asp Gly Tyr Ser Ala Ser Trp Val Val Pro 
385 390 395 400 

Phe Gly Ala Arg Ala Tyr Phe Glu Thr Met Gin Cys Lys Ser Glu Lys 
405 410 415 

35 Glu Pro Leu Val Arg Ala Leu lie Asn Asp Arg Val Val Pro Leu His 

420 425 430 

Gly Cys Asp Val Asp Lys Leu Gly Arg Cys Lys Leu Asn Asp Phe Val 
435 440 445 

Lys Gly Leu Ser Trp Ala Arg Ser Gly Gly Asn Trp Gly Glu Cys Phe 
40 450 455 460 

Ser 
465 

(2) INFORMATION FOR SEQ ID NO: 7: 

45 (i) SEQUENCE CHAPACTERISTICS : 

(A) LENGTH: 1567 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 



SO 



55 



(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 
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(A) NAME/KEY: CDS 

(B) LOCATION: join {78 . .124, 177.. 1527) 

^ {XX) SEQUENCE DESCRIPTION: SEQ ID NO : 7: 

ACGTCCCAGG TCGGGGACTA CATCCGCTAT GTGGTCCTCT ACTTCGTCGG AAGAATATAC 
60 

TGTCTCTTGT GGCTACC ATG GGG GTT TTC GTC GTT CTA TTA TCT ATC GCG 

Met Gly Val Phe Val Val Leu Leu Ser lie Ala 
1 5 10 

ACT CTG TTC GGC AG GTATGTGCAC CGCTCTAGGT TCAACTCGCC TGGTAACTGA 
164 



IS 


Thr 


Leu 


Phe 


Gly 
15 


Ser 


























CAAACAGCAC AG C 
213 


ACA 
Thr 


TCG 
Ser 


GGC 
Gly 


ACT 

Thr 
20 


GCG 
Ala 


CTG 
Leu 


GGC 
Gly 


CCC 
Pro 


CGT 

Arg 
25 


GGA 
Gly 


AAT 
Asn 


CAC 
His 


20 


AGC 
261 
Ser 


GAC 

Asp 
J u 


TGC 
Cys 


ACC 
Thr 


TCA 
Ser 


GTC 
Val 


GAC 

Asp 
35 


CGG 
Arg 


GGG 
Gly 


TAT 
Tyr 


CAA 
Gin 


TGC 

Cys 
40 


TTC 
Phe 


CCT 
Pro 


GAG 
Glu 


CTC 
Leu 


25 


TCC 
309 
Ser 
45 


CAT 
His 


AAA 

Lys 


TGG 
Trp 


GGT 
Gly 


CTC 

Leu 
50 


TAC 
Tyr 


GCG 
Ala 


CCC 
Pro 


TAT 
Tyr 


TTC 

Phe 
55 


TCC 
Ser 


CTC 
Leu 


CAG 
Gin 


GAT 
Asp 


GAA 

Glu 
60 


30 


TCT 
357 
Ser 


CCG 
Pro 


TTT 
Phe 


CCT 
Pro 


CTG 

Leu 
65 


GAC 
Asp 


GTC 
Val 


CCG 
Pro 


GAT 
Asp 


GAC 

Asp 
70 


TGC 
Cys 


CAC 
His 


ATC 
He 


ACC 
Thr 


TTT 

Phe 
75 


GTG 
Val 




CAG 
405 
Gin 


GTG 
Val 


CTG 
Leu 


GCC 

Ala 
BO 


CGA 
Arg 


CAT 
His 


GGA 
Gly 


GCG 
Ala 


CGG 

Arg 
85 


TCT 
Ser 


CCA 
Pro 


ACC 
Thr 


GAT 
Asp 


AGC 

Ser 
90 


AAG 
Lys 


ACA 
Thr 


35 


AAG 
453 
Lys 


GCG 
Ala 


TAT 

Tyr 
95 


GCC 
Ala 


GCG 
Ala 


ACT 
Thr 


ATT 
He 


GCA 

Ala 
100 


GCC 
Ala 


ATC 
He 


CAG 
Gin 


AAG 
Lys 


AAT 

Asn 
105 


GCC 
Ala 


ACC 
Thr 


GCG 
Ala 


40 


TTG 
501 
Leu 


CCG 

Pro 
110 


GGC 
Gly 


AAA 
Lys 


TAG 
Tyr 


GCC 
Ala 


TTC 

Phe 
115 


CTG 
Leu 


AAG 
Lys 


TCG 
Ser 


TAC 
Tyr 


AAT 

Asn 
120 


TAC 
Tyr 


TCC 
Ser 


ATG 
Met 


GGC 
Gly 


45 


TCC 
549 
Ser 
125 


GAG 
Glu 


AAC 
Asn 


CTG 
Leu 


AAC 
Asn 


CCC 

Pro 
130 


TTC 
Phe 


GGG 
Gly 


CGG 
Arg 


AAC 
Asn 


CAA 

Gin 
135 


CTG 
Leu 


CAA 
Gin 


GAT 
Asp 


CTG 
Leu 


GGC 

Gly 
140 




GCC 
597 
Ala 


CAG 
Gin 


TTC 
Phe 


TAG 
Tyr 


CGT 

Arg 
145 


CGC 
Arg 


TAC 
Tyr 


GAC 
Asp 


ACC 
Thr 


CTC 

Leu 
150 


ACC 
Thr 


CGG 
Arg 


CAC 
His 


ATC 
He 


AAC 

Asn 
155 


CCT 
Pro 


SO 


TTC 
645 
Phe 


GTC 
Val 


CGG 
Arg 


GCC 

Ala 
160 


GCG 
Ala 


GAT 
Asp 


TCC 
Ser 


TCC 
Ser 


CGC 

Arg 
165 


GTC 
Val 


CAC 
His 


GAA 
Glu 


TCA 
Ser 


GCC 

Ala 
170 


GAG 
Glu 


AAG 
Lys 
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5 


TTC 
693 
Phe 


GTC 
val 


GAG 

Glu 
175 


GGC 
Gly 


TTC 
Phe 


CAA 
Gin 


AAC 
Asn 


GCC 

Ala 
180 


CGC 
Arg 


CAA 
Gin 


GGC 
Gly 


GAT 
Asp 


CCT 

Pro 
185 


CAC 
His 


GCC 
Ala 


AAC 
Asn 




CCT 
741 
Pro 


CAC 

His 
190 


CAG 
Gin 


CCG 
Pro 


TCG 
Ser 


CCG 
Pro 


CGC 

Arg 
195 


GTG 
Val 


GAT 
Asp 


GTA 
Val 


GTC 
Val 


ATC 

He 
200 


CCC 
Pro 


GAA 
Glu 


GGC 
Gly 


ACC 
Thr 


10 


GCC 
789 
Ala 
205 


TAC 
Tyr 


AAC 
Asn 


AAC 
ASH 


ACG 
Thr 


CTC 

Leu 
210 


GAG 
Glu 


CAC 
His 


AGC 
Ser 


ATC 
He 


TGC 

Cys 
215 


ACC 
Thr 


GCC 
Ala 


TTC 
Phe 


GAG 
Glu 


GCC 

Ala 
220 


15 


AGC 
837 
5er 


ACC 
Thr 


GTC 
Val 


GGC 
Gly 


GAC 

Asp 
225 


GCC 
Ala 


GCG 
Ala 


GCA 
Ala 


GAC 
Asp 


AAC 

Asn 
230 


TTC 
Phe 


ACT 
Thr 


GCC 
Ala 


GTG 
Val 


TTC 

Phe 
235 


GCG 
Ala 




CCG 
885 
Pro 


GCG 
Ala 


ATC 
lie 


GCC 

Ala 
240 


AAG 
Lys 


CGT 
Arg 


CTG 
Leu 


GAG 
Glu 


GCC 

Ala 
245 


GAT 
Asp 


CTG 
Leu 


CCC 
Pro 


GGC 
Gly 


GTG 

Val 
250 


CAG 
Gin 


CTG 
Leu 


20 


TCC 
933 
Ser 


GCC 
Ala 


GAC 

Asp 
255 


GAC 
Asp 


GTG 
Val 


GTC 
Val 


AAT 
Asn 


CTG 

Leu 
260 


ATG 
Met 


GCC 
Ala 


ATG 
Met 


TGT 
Cys 


CCG 

Pro 
265 


TTC 
Phe 


GAG 
Glu 


ACG 
Thr 


25 


GTC 
981 
val 


AGC 

Ser 
270 


CTG 
Leu 


ACC 
Thr 


GAC 
Asp 


GAC 
Asp 


GCG 

Ala 
275 


CAC 
His 


ACG 
Thr 


CTG 
Leu 


TCG 
Ser 


CCG 

Pro 
280 


TTC 
Phe 


TGC 
Cys 


GAC 
Asp 


CTC 
Leu 


30 


TTC ACC 
1029 
Phe Thr 
285 


GCC 
Ala 


GCC 
Ala 


GAG 
Glu 


TGG 

Trp 
290 


ACG 
Thr 


CAG 
Gin 


TAC 
Tyr 


AAC 
Asn 


TAC 

Tyr 
295 


CTG 
Leu 


CTC 
Leu 


TCG 
Ser 


CTG 
Leu 


GAC 

Asp 
300 




AAG TAC 
1077 
Lys Tyr 


TAC 
Tyr 


GGC 
Gly 


TAC 

Tyr 
305 


GGC 
Gly 


GGC 
Gly 


GGC AAT 
Gly Asn 


CCG 

Pro 
310 


CTG 
Leu 


GGC 
Gly 


CCC 
Pro 


GTG 
Val 


CAG 

Gin 
315 


GGC 
Gly 


35 


GTG 

1125 

Val 


GGC 
Gly 


TGG 
Trp 


GCG 

Ala 
320 


AAC 
Asn 


GAG 
Glu 


CTG 
Leu 


ATC 
He 


GCG 

Ala 
325 


CGG 
Arg 


CTG 
Leu 


ACG 
Thr 


CGC 
Arg 


TCC 

Ser 
330 


CCC 
Pro 


GTC 
Val 


40 


CAC 

1173 

His 


GAC 

i 

Asp 


CAC 

HIS 

335 


ACC 
Thr 


TGC 
Cys 


GTC 
vai. 


AAC 
Asn 


AAC 

Asn 
340 


ACC 
Thr 


CTC 
Leu 


GAC 
Asp 


GCC 
Ala 


AAC 

Asn 
345 


CCG 
Pro 


GCC 
Ala 


ACC 
Thr 


45 


TTC CCG 
1221 
Phe Pro 
350 


CTG 
Leu 


AAC 
Asa 


GCC 
Ala 


ACC 
Thr 


CTC 

Leu 
355 


TAC 
Tyr 


GCG 
Ala 


GAC 
Asp 


TTT 
Phe 


TCG 

Ser 
360 


CAC 
His 


GAC 
Asp 


AGT 
Ser 


AAC 
Asn 


CTG GTG 
1269 
Leu Val 
365 


TCG 
Ser 


ATC 
He 


TTT 
Phe 


TGG 

Trp 
370 


GCG 
Ala 


Leu 


GGT 
Gly 


CTG 
Leu 


TAC 

Tyr 
375 


AAC 
Asn 


GGC 
Gly 


ACC 
Thr 


AAG 
Lys 


CCC 

Pro 
380 


SO 


CTG TCG 
1317 
Leu Ser 


CAG 
Gin 


ACC 
Thr 


ACC 

Thr 
385 


GTG 
Val 


GAG 
Glu 


GAT 
Asp 


ATC 
He 


ACC 

Thr 
390 


CGG 
Arg 


ACG 
Thr 


GAC 
Asp 


GGG 
Gly 


TAC 

Tyr 
395 


GCG 
Ala 
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35 



40 



45 



50 



GCC GCC TGG ACG GTG CCG TTT GCC GCC CGC GCC TAG ATC GAG ATG ATG 
1365 

Ala Ala Trp Thr Val Pro Phe Ala Ala Arg Ala Tyr He Glu Met Met 
400 405 410 

CAG TGT CGC GCG GAG AAG CAG CCG CTG GTG CGC GTG CTG GTC AAC GAC 
1413 

Gin Cys Arg Ala Glu Lys Gin Pro Leu Val Arg Val Leu Val Asn Asp 
415 420 425 



CGT GTC 


ATG 


CCG 


CTG 


CAC 


GGC TGC GCG 


GTG 


GAT 


AAT CTG GGC AGG TGT 


1461 


















Arg Val 


Met 


Pro 


Leu 


His 


Gly Cys Ala 


val 


Asp 


Asn Leu Gly Arg Cys 


430 










435 






440 


AAA CGG 


GAC 


GAC 


TTT 


GTG 


GAG GGA CTG 


AGC 


TTT 


GCG CGG GCA GGA GGG 


1509 


















Lys Arg Asp 


Asp 


Phe 


Val 


Glu Gly Leu 


Ser 


Phe 


Ala Arg Ala Gly Gly 


445 








450 






455 


460 


AAC TGG 


GCC 


GAG 


TGT 


TTC 


TGATGTACAT GCTGTAGTTA GCTTTGAGTC 


1557 


















Asn Trp 


Ala 


Glu 


Cys 


Phe 














465 













IS 



^ CTGAGGTACC 
1567 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 466 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Gly Val Phe Val Val Leu Leu Ser He Ala Thr Leu Phe Gly Ser 
15 10 15 

Thr Ser Gly Thr Ala Leu Gly Pro Arg Gly Asn His Ser Asp Cys Thr 
20 25 30 

Ser Val Asp Arg Gly Tyr Gin Cys Phe Pro Glu Leu Ser His Lys Trp 
35 40 45 

Gly Leu Tyr Ala Pro Tyr Phe Ser Leu Gin Asp Glu Ser Pro Phe Pro 
50 55 60 

Leu Asp Val Pro Asp Asp Cys His He Thr Phe Val Gin Val Leu Ala 
65 70 75 80 

Arg His Gly Ala Arg Ser Pro Thr Asp Ser Lys Thr Lys Ala Tyr Ala 
85 90 95 

Ala Thr He Ala Ala He Gin Lys Asn Ala Thr Ala Leu Pro Gly Lys 
100 105 110 

Tvr Ala Phe Leu Lys Ser Tyr Asn Tyr Ser Met Glv Ser Glu Asn Leu 
115 120 ' 125 

Asn Pro Phe Gly Arg Asn Gin Leu Gin Asp Leu Gly Ala Gin Phe Tyr 
130 135 140 

Arg Arg Tyr Asp Thr Leu Thr Arg His He Asn Pro Phe Val Arg Ala 
145 150 155 160 

Ala Asp Ser Ser Arg Val His Glu Ser Ala Glu Lys Phe Val Glu Gly 
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IS 



20 



165 170 175 

Phe Gin Asn Ala Arg Gin Gly Asp Pro His Ala Asn Pro His Gin Pro 
180 185 190 

Ser Pro Arg Val Asp Val Val lie Pro Glu Gly Thr Ala Tyr Asn Asn 
195 200 205 

Thr Leu Glu His Ser lie Cys Thr Ala Phe Glu Ala Ser Thr Val Gly 
210 215 220 

Asp Ala Ala Ala Asp Asn Phe Thr Ala Val Phe Ala Pro Ala lie Ala 
225 230 235 240 

Lys Arg Leu Glu Ala Asp Leu Pro Gly Val Gin Leu Ser Ala Asp Asp 
245 250 255 

Val Val Asn Leu Met Ala Met Cys Pro Phe Glu Thr Val Ser Leu Thr 
260 265 270 

Asp Asp Ala His Thr Leu Ser Pro Phe Cys Asp Leu Phe Thr Ala Ala 
275 280 285 

Glu Trp Thr Gin Tyr Asn Tyr Leu Leu Ser Leu Asp Lys Tyr Tyr Glv 
290 295 300 

25 Tyr Gly Gly Gly Asn Pro Leu Gly Pro Val Gin Gly Val Gly Trp Ala 

305 310 315 320 

Asn Glu Leu He Ala Arg Leu Thr Arg Ser Pro Val His Asp His Thr 
325 330 335 

30 Cys Val Asn Asn Thr Leu Asp Ala Asn Pro Ala Thr Phe Pro Leu Asn 

340 345 350 

Ala Thr Leu Tyr Ala Asp Phe Ser His Asp Ser Asn Leu Val Ser He 
355 360 365 

35 Phe Trp Ala Leu Gly Leu Tyr Asn Gly Thr Lys Pro Leu Ser Gin Thr 

370 375 380 

Thr Val Glu Asp He Thr Arg Thr Asp Gly Tyr Ala Ala Ala Trp Thr 
385 390 395 400 

^ Val Pro Phe Ala Ala Arg Ala Tyr He Glu Met Met Gin Cys Arg Ala 

405 410 415 

Glu Lys Gin Pro Leu Val Arg Val Leu Val Asn Asp Arg Val Met Pro 
420 425 430 

^ Leu His Gly Cys Ala Val Asp Asn Leu Gly Arg Cys Lys Arg Asp Asp 

435 440 445 

Phe Val Glu Gly Leu Ser Phe Ala Arg Ala Gly Gly Asn Trp Ala Glu 
450 455 460 

Cys Phe 
465 
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Claims 



1 . A process for the production of a modified phytase with improved activity properties characterized therein that the 



32 



following steps are effected: 



EP0 897 010 A2 



a) the three dimensional structur of the phytase to be modified and, optionally of another phytase with activity 
properties which are more favorable than the ones of the phytase to be modified is/are computer modelled on 
the basis of the three dimensional structure of the phytase of Aspergillus niger; 

b) the structure of the active sites of the phytase to be modified and of the phytase with the more favorable 
activity properties are compared and those amino acid residues in both active sites which are different are 
identified; 

c) a DNA sequence coding for a modified phytase is constructed by changing the nucleotides coding for at 
least one of the amino acids by which both active sites differ; 

d) integrating such a DNA sequence into a vector capable of expression in a suitable host cell; 

e) transforming a suitable host cell by the DNA sequence of c] or the vector of d). growing said host cell under 
suitable growtii conditions and isolating tiie modified phytase from the host cell or the culture medium by meth- 
ods known in tiie state of the art. 

2. A process as claimed in claim 1 wherein the phytase to be modified is of eukaryotic origin. 

3. A process as claimed in claim 1 or 2 wherein the phytase to be modified is of fungal origin. 

4. A process as claimed in any one of daims 1 to 3 wherein the phytase to be modified is of Aspergillus origin. 

5. A process as claimed in any one of claims 1 to 4 wherein the phytase to be modified is a phytase from Aspergillus 
fumigatus. 

6. A process as claimed in any one of claims 1 to 5 wherein the phytase with the more favorable activity properties is 
of eukaryotic origin. 

7. A process as claimed in any one of claim 1 to 6 wherein the phytase with the more favoratsle activity properties is 
of fungal origin. 

8. A process claimed in any one of claims 1 to 7 wherein the phytase witii tiie more favorable activity properties is of 
Aspergillus origin. 

9. A process as claimed in any one of claims 1 to 6 wherein the phytase with the more favorat>le activity properties is 
a phytase from Aspergillus terreus. 

10. A process as claimed in any one of claims 1 to 9 wherein the phytase to be modified Is a phytase of Aspergillus 
fumigatus and the phytase with the more favorable activity properties is the Aspergillus niger phytase. 

11. A process as claimed in any one of claims 1 to 9 wherein the phytase to be modified is a phytase of Aspergillus 
fumigatus and the phytase witii the more favorable activity properties is the Aspergillus terreus phytase. 

1 2. A modified phytase obtained or obtainable by a process as claimed in any one of claims 1 to 11 . 

1 3. A phytase which has been modified in a way that its activity property is more favorable than the one of the non>mod- 
if led phytase. 

14. A phytase according to claim 13 characterized therein that the amino acid sequence of the non-modified phytase 
has been changed by deletion, substitution and/or addition by one or more amino acids. 

1 5. A phytase according to any one of claims 13 or 1 4 wherein changes have been made at at least one position which 
is homologous to one of the following positions of the amino acid sequence of the phytase of Aspergillus niger: 27, 
66, 71, 103, 140, 141. 188. 205, 234. 235, 238. 274. 277, 282, 340 and/or 424, preferably 27, 68, 140, 205. 274, 
277. 282 and/or 340. 
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1 6. A phytase according to any one of claims 13 to 15 which is the phytase of Aspergiilus, preferably Aspergillus fumi- 
gatus. 

1 7. A phytase according to any one of dainr^ 15 or 1 6 wherein at position 27 or at least at position 27 a change occurs. 

1 8. A phytase according to claim 1 7 wherein the amino acid at position 27 is replaced by one selected from one of the 
following groups: 

a) Ala. Val. Leu. lie; or 

b) Thr, 

1 9. A phytase according to claim 1 7 or 18 wherein in addition to position 27 a change occurs also at position 66. 

20. A phytase according to claim 17 or 18 wherein in addition to position 27 a change occurs also at position 140. 

21 . A phytase according to any one of claims 1 3 to 20 which is characterized by at least one of the following nnutations: 
Q27U Q27N. Q27T, Q27I. Q27V. Q27A. Q27Q. S66D. S140Y. D141Q. A205E. Q274U G277D, G277K. Y282H 
and/or N340S. 

22. A DNA sequence comprising a DMA sequence coding for a phytase as claimed in any one of claims 12 to 21. 

23. A vector comprising the DNA sequence of claim 22. 

24. The vector of claim 23 which is an expression vector. 

25. A host cell which has t>een transformed by a DNA sequence as claimed in claim 22 or the vector of claim 23 or 24. 

26. A process for the preparation of a phytase as claimed in any one of claims 12 to 21 wherein the host cell as claimed 
in claim 25 is cultured under suitable culture condhions and the phytase is isolated from such host cells or the cul- 
ture medium by methods known in the art 

27. A food or feed composition comprising a phytase as claimed in any one of claims 12 to 21 . 
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Figure 3 
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Fifnire 4/1 

• • • . . 

I. TCTCTAACCGATACKIGGACCGACTAGGCATCCrri^ 50 

• • • • • 

SL GACAATCCAACTCAGTCGAATATGAACGGCrACAKCAGCAT^ 100 

• • . ■ , 

10 1 GGCCCTCTAGGTCGGCCTCCCCXXJATGAGGAGGACCAGGCTC^ ISO 

IS I TrCGGTCATGG Ll L i lll ' U UrGOTCC ClCt ' JL rCG CTTTATTACTTOTrA 200 
MAFFTVALSLYYLLS 15 

201 CGAG^gagacccccacaacacccgcccgcccagccgaaccggcacccac 250 
R. 16 



2S1 CcgcacagAGTCTCTCCTCWSGCCCCAGTCGTCC^ 300 
VSAQAPVVQNHSCN 30 



301 TACOGCGGACGGTGGATATCAATGCTTCCCCA A TgrCTCTC A ' iU i ' l ' lUG G 350 
TAOGGYQCFPNVSKVWG 47 



3S1 CTCAGTACrCGCCGTAC t IC ILCATCGAGCACGAGTCAGCTATCrCTGAG 400 
QYSPYF5XEQESAI SE 62 



401 CACGTGCCrCATGGCTGTGAGCTTAC Cx i ' iblU CAG G T G CTCrCGCGGCA 450 
DVPHGCEVTFVQVLSRH 80 



451 TGC^jGCTACGTATCCGACAGAGTCGAAGAGTAAGGCGTAC^^ soo 
CARYPTESKSJCAVSCLI 97 



501 TrGAACCAATCCaGAAGAATGCTAC CiCTn - i ' itiG C<^^ 550 

EAtQKMATSFWGQYAF 113 
♦ 

531 CTGGAGAGTr ATAACTATACCCTCGGCGCGGATGACrTGACTATC^^ 600 

LESYNYTLGAOOLTX FG 130 

• • • • • 

601 CGAGAACCACATGGTTCATTCGGGTGCCAA G Il ' C I A CCGACGCTrATAAGA 650 

ENQMVDSGAKFYRRYKN 147 

• > , « • 

63 2. ATCTXrGCCACGAAAAATACTCCTrrrATCCGTGCATCAGGCT^^ 700 

LARKNTPFIRASGSOR 163 
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Figure 4/2 



70L GTCCrrrCCCrrCTGCGGAGAAGrrCATTA^TCXiA^ 7S0 

VVASAEKFXNGFRKAQL 130 

7Sl CCACGACCATGGCrCCAAACCrrGCrACGCCA G ' L ' iCf I ' C AATGTGATTATCC 900 

HDKCSKRATPVVNVIIP 197 



801 CTGAAATCGATGGGrrrAAC^ACACCCTGGACCATACC^ 850 

EIDGrNMTLOHSTCVS 213 

\ 

851 TrrGACPATCXTG^CCGQ^^ 900 

FEbrOERADCIEANPTAI 230 

901 TATGGCWCCTCCCyVTCCGCAAACGTCTGGAAAATGACC^ 950 

MCPPXRfCRLEMOLPGIK 247 



9S1 AACrrACAAJaZGAGAATGTAATATATTTCATGCAT ^ 1000 
LTKEMVZYLMDMCS FD 263 



1001 ACCATTOrcy:GCACCGCCCACGGAAgQ3AGCT^^ lOSO 
TMARTAKGTELSPFCAI 280 



1051 crrcfrCTgfrAAftggAgTCgg^^ noo 

FTEKEWLQTfOYLQSLSK 297 



1101 fiSiaCTACGGCTACGGTGCCGGAAGCCCCCTTGGCCCAG^ 1150 

YrGYCAGS PliGPAQGI 313 

1151 GGCrrCACCAACGAGCrCATtGCCCGACTAACGCAATCGCCCGT 1200 

GFTNEL rARLTQSPVQD 330 

1201 CAACACAAGCACCAACCACACrCTAGACTCGAACCCACCGA^^ 1250 

MTSTMHTLDSNPATFPL 347 



a • • • • 

1251 TCCACAGGAAGCTCTACGCCGA C t I C CCACGACAATAGCATGATATCG 1300 

0RKLYA0F.SKDN,S M^I S 3S3 

• • ■ • • • 

1301 ATATTCITCGCC^TGCGTCTGTACAJKCGCCACCCAGCCGC^ 1350 

IFFAMCLYNGTQPLSKO 330 
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Figure 4/3 



« • • • . 

13SL TTCCGTGGAlTrCGATCCJVGGAGATGGACGC^ 1400 





SVESIQEMDGYAASWTV 


397 


1401 


• • • • « 
TTCCGl t. IGO lGCGAGGGCTTALTI*lXiACCTCATCCAGrGCGAGAAGAAG 
PFGARAYFELMQCEK5C 


413 




* • • • « 
E PLVRVLVNDRVVPLKG 


430 


ISO I 


• • • • • 
CTCCGCAGTTGACAAO LU'LGGACGGTCCACTrTGGACGATrGGGTAGACG 
CAVDKFG RCTLDOMVEC 


1S50 
447 


ISSl 


• • • • • 
GCrrCiAALTTltiCAAaiAaJGGCXWGAACT^^ 

LNFARSGGNWJCTCFTL 


1600 
463 


1€01 


• • • * • 

TAAAGGGCG auucravTTCATAAGixrrrGitx^^ 


1650 


1651 


GGAATTAGClLri 1 li.AXrrrrAClX:VLATrAGACCAAGAAIGAa l^G'/ilG 


1700 


1701 


TTCrCAAbGCL i IL lAGCATATCCTCAACTGGGATAAATCACCTATCCrC 


1750 


1731 


CATGTGTAGGIGAACCCGC I'CnXJCA'iXi&ACC^CVXVIUlTTCAGACTAC 


laoo 


laoi 


TTTCACCAAACATATCCICGIUTCCU'CTCIU'CTUJI^^^^ 


laso 


lasi 


TACAClU'LaX-TClATCTATATCGTOVACAAAACTACCACC^ 


1900 


1901 


• • • 
ATCTCACACirrCCAGCACGAAATTTCTTCC 1931 
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Figure 5/1 

1 TTCCACGCrCAAAGCCTGACTGCGATTTCCAAGCTCCAT^ S 0 

S 1 AACTGCCTGCrrATCrrCATCACACGCAGATACAC^ 100 

• • ■ . • 

101 TGOVCCCATGACGGACGAACGCACCGCTCTCrrGGCCTCCA^ 130 

ISl AGGTCGACXIGCGATGAGGTCGCGCCCrCGACGGCCTCCC^ 200 

• > • ■ ' • 

201 CAGTTGAGATCTCGCTGCGAACGTCGACCGCAGATATGv/i lOtCl ICGAC 250 

251 OU I CC XX:GCCrn:CAGGAACjA Al 'i OC l' GC T G T G ACGATC ^ 10 V 300 

K S L L L 5 

■ • • • • 

. 301 TG ClUGlGC r G TCCCGCGGCTTSGTCGCGTTATAgcacgccccccccccc .330 

LVLSGGLVALY IS 

• ■ ■ • ■ 

351 cggtcacaccgccctccgccaacgcccccacaaccgaagTGTCTCAAGAA 400 

V S R N 20 

401 ATCCGCATCTIGATACCCACTCTtGCAATACAGTX^^ 4 S 0 

PKVDSKSCNTVEGGYQ 36 

■ • • • • 

451 TGTCGTCCAGAAATCrCCCACrrCCTGGGGCCAGTATTCTCCATTC^ 500 

CRPEIS KSWGQYSPFFS S3 

501 CCrGGCACACCACTCGGACATCrCGCCACATGTCCC^^ 550 

LADQSEISPDVPQ NCKI 70 

■ • « • • 
TFVQLLSRKGARYPTS 86 

SQL TCCAAGAa^GACCTGTATTCGCAGCTGATCACIXXGATTC^ S50 

SKTELYSQL tSRtQKTA 103 

• . • • • 

651 GACTCCGTACAAACGCTACr ATGCCTTCrrCAAAG ACTACA^ 700 

TAtKGVYAFLKOYRYQL 120 

7 0 L TGGGAGCGAACGACCrG ACGCCCrrrCGGGAAAACCAG AT^ 750 

GANDLTPFGENQMIQL 136 
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' * • • 

7 S 1 CGC\TCAACTTrTATAACCATTACAACACTCTCCCCACGAA7CCCC^ fl 0 0 

CIKFVNKYICSLARNAVP 153 



• • . . . 

801 ATTCGTTCGTTGCTCCKCrCTCATCGGGTCArrGCCTCGG 830 
FVRCSCSDRVXASGRLr 170 



851 TCATCGAACGrrTCCACAGCGCCAAACTGCTC^SATCCTCATTC^ 900 

iegfqsakvlopksdk: las 

901 CATCACGCTCCTCCCACGATCAACGTGATCATCGAC^^ 950 

HOAPPTtNVirEEGPSV 203 



• • • • • 

951 CAATAACACGCTCGACACCGGCAGCrcrrCCAGTCTTTGAGGAC^ 1000 

MNTLDTGSCPVFEOSSG 220 

*■ 

1001 GGGGACATCACCCACACGAAAACrrCGCAAAGCAATrOT 1050 

GKOAQEKFAICQFAPAI 236 

1051 CTGGAAAAGATCAAGGACCATCTTCCCGGCCTGGACCrGGC CC T C T ^ 1100 

LEKZKOKLPGVDLAVSO 253 



1101 TGTACCGrACrrGATGGAC'l'iU LO l CC Grri - U AGACCTTCGCTCGCAACC 1150 
-VPYLMOLCP FETLARNH 270 



1151 ACACAGACACGCiGrCTCLUl lL il^CGCT C l I IC CACGCAAGAGGACTCG 1200 
TOTLSPFCALSTQESM 286 



" • • • • 

120 1 CAAGCATATGArrACTACrJUVACrrCTGCg^ 1250 

QAYDVYQSLGKYYGNGG 303 

• • • • • 

1251 GGGTAACC CG I ' lGG GGCCACCCCAAGGCCTGGG C/ 1 1 i CAACGAGTrGA 1300 

GNP LC PAQGVG FVtfE L I 320 

• • * • • 

1 3 C 1 TTCCTCGCATCACCCATACC O: ila I ' C CAGGACTACACCACGGTCAACCAC 1350 

ARMTKSPVQDYTTVNK 336 

• • • • • • 

1351 ACTCTTGACTCGAATCCqGCCAC\TTCC <- l l 1.U AACGCGACGCTGTACGC 1400 

TLOSNPATFPLWATLYA 353 
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Fipmrft S/a 



• • • • ■ 

1401 ACATTTCAGCCACGACAACACAATXlAaTrCAArrTrCGCG^ 14 SO 

OFSKQNXMTSIFAALGL 370 



• • • • • 

14 SI TGTACAACGGGACCGCGAAGCTCTrcCACGACCGACATCAA^ IS 00 

YNCTAKLSTTHIKSXE 335 



ISOl GAGACGGACGGCTACTCCKSCGGCCrrGGACCCrrrCCCr:^ 1S30 

ETOGYSAAWTVP?GGRX 403 

ISSl CTATATCGACATCATGCACTGTGATGATTCGGATC^^ ISOO 

YIEMMQCDDSDEPVVRV 420 

1601 TGCTGGT C AACGACCGG G TG G TG C C^CrGavr^^ 1630 

LV5I0RVVPLHGCEVDS 436 

• • • « • 

ISSl CTGGGGCGATGCAAACGAGACGA Lllibl(- AGGGGACTGA L.l ' iaiU :^ 1700 

LGRCKRDOFVRGLS FAR 4S3 



1701 ACAGGGTCGGAACTGGGAGGG G "GT ' l A CGCIX X ' I'l Xl t U ACTAGGTTTATT 1750 

QCGMWEGCYAASE • 466 
• • • • ■ 

1751 CACCGA(7iTTCGACCrrrCTATCCrrCAAACACTGC^^ 1800 

1301 CATCAAATGGTAACAGGCCTGGACC tj k i i lA GAACGAAAAAAGTT 1845 
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• • . - • 

I AGATTCAACGACGGAGGAATCGCAACCCTAATrGTCGGTATCA^^ 50 

M V T 3 



• • • • • 

S I TCTGALi. UCCiLr, i I ICGGCGGCGTATCI U ,1;. i CX G Ggcgagcggcc c 100 
LTcLLSAAYLLSG IS 



10 1 ggacccaccgcccggacagggccgcggcgccgaccccgaaacggagTAGA ISO 

R 17 



* • ■ ■ • 

IS I CrCGTCTGCGGCACCTACrrTCrGC^^ 200 
VSAAPSSAGSKSCOTVD 34 



• • • • • 

201 CCTCCGC7rACCAGTCCTCCCCrGCCy«riTCrC^ 250 
LGYQCSPATSHLWGQVS 51 



• ■ • • • 

251 CGCCAi l C IlI IC Cr C iCUAGGACGACCTCTCCGT G TC C ACTAAGCITCCC 300 
PFFSLEDELSVSSKLP 67 



301 AACXSArLCCCGGATCACCrTGGXiW::^^ 3 SO 

KDCRITLVQVLSRKGAR 64 



331 CTACCCAACCACCTCCftAGACCAAAAACTATAAGAAC C i ' lG lU ACGGCGA 400 
YPTSSKSKKYKKLVTAI 101 



• • • • • 

401 TCCAOXXIAATCCCACCXSACITCAACGCC^^ 4 50 

QAHATOFKCKFAFL KT 117 

• • • • • 

431 TACA^CrATACTCTGGGTCCGCATGACCTCA LICCCri IG GGGAGCAGCA 500 

YNYTLCADOLTP-FCEQQ 134 

501 GCTGGTGAACTCGGGCATCAACrrrCTACCAGAG^ 530 

LVNSGIKFYQRYKALAR 151 



531 GCA G TGTGCTGCCGTTrATTCGCGCCTCAGGCTCCGACCGC^ 600 
SVVPFrRASGSORVrA 157 
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Figure fi/2 



• - . • • 

6 0 L TCaX;ACAGAAGTTCATCGAOGGGTTCCAGCPC<}^ 630 

SGEKFIECcQQAKLADP 134 

• • . . • 

651 TGGCGCGACGAACCGCGCCGCTCCGGCGATTAG rG i a ATTATTCCGGAGA 700 

GATMRAAPAtSVir PES 201 



70 1 CCGAGACGTTCAACAATACGCTGGACCIACCGTCT^^ 7S0 
ETFNNTLDKCVCTKFE 217 



• ■ • • • 

751 GCGAGTCAGCTGGGAGATGACGTTGCGGCCXATTTCAC^ 800 
ASQLGOEVAAMFTALFA 234 



801 ACCCGACATCCGAGCTCGCGCCGACAAGCATC l-rCCC G GCCTGACGCTGA 850 
POIRARAEKKLPCVTLT 251 



851 CAGACGACGAC GTiXiT CACTCTAATGGACAi.Xj L U ri L Oi rm ATACGGTA 900 
DEDVVSLMDMCSFDTV 267 



901 g< ; :gOTCACC/^KgA( ; :<y : ftA(r^^ ^ tctcaa^ l l hujas sso 

ARTSDASQLSPFCQLFT 284 



951 TCAgAATCAGTGGAAGAACTAgAA(-rArrTTCAf7rCC^^ 1000 
H MEWKKY N Y LQSLG KY Y 301 



1001 ACCGCTACGCCGCACCCAACCCTCTGGGACCGCCT^^ 1050 

GYCACNPLCPAQGIGF 317 

lOSl ACCAACGAGCTGATTCCCCGCTTGACTCCTTCGCCA^ 1100 

T er E L r A R L T R S P V Q D K T 334 

1101 CACCACTAACTCGACTCTACTCTCCAACCCGGCCACCT^ 1150 

STMSTL V SfcTPATFPLNA 351 



1151 CTACCATGTACGTCGACv t L lCACACGACAACAGCATGCTTTCCATCTTC 1200 
TMYV O FSKD MSMVS X F 337 



1201 TTTCCAriGGGCCTGTACAACGGCAC^ 1230 
FALGLYNG TEPLSRTS V 384 
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Figure 6/3 



• • . • • 

1251 GGAAACCGCCAAGGAATTGGATGGGTArrCTCCATCC^^ 1300 
ESAKELDClfSASWVV P F 401 



1301 XCGGCCKX^rCAGCCrACrrCGAGACGATGCAATC^^ 1350 

GARAYFETMQCKSEKE 417 

• ■ « • • 

1351 C C ILl i:GI iCGCG Ci L LU VTTAATGACCGG G T TG I G CCACrGCATGGCTG 1400 

PLVRALtMORVVPLKGC 434 



1401 CGATGTGCACAACCrCGGGCCATGCAAGCTGAATGALi l lUiUVACGGAT 1430 
DVOKLGRCKLNDFVKGL 451 



« ■ • • • 

1451 TGAGTTGGCXZCAGATCTGGGGGCAACTGGGGACA G IGC L 1 1 A CTTGAGAT 1500 
SHARSGGHWGECF5 « 4$5 



« • • • ■ 

ISOl GTC AI 10 1 r A TGCrATACTCCAATAGAC a7l^XXU ' lA CCCArrCACT^ 1550 

1531 CTTtGCTCGAACCGCCtGCCC 1571 
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Figure 7/1 



I ACGTCCC«X7rCG3CX;ACTACATCCGCTATGTGCr^ S 0 

SI AACAATATACTGTCT C i LUt GGCTACCATGGGGG ri'l TCG 't C G L\ CTATT 100 

MGVFVVLL a 



10 1 ATCTATCGCGACrCTGTTCGGCAGgcacgcgcaccgccccaggcccaacc ISO 
SIATLFGS 16 



15 1 cgcccgccaaccgacaaacagcacagCACATCGGGCACTGCGCTGGGCCC 200 

TSGTALGP 24 



201 CCGTGGAftATCACACCGACTGCACCrCAiTrCGACCGGGGG^ 250 
RGNKSDCTSVDRGYQCF 41 



2 S 1 TCCCTGAGCTCrCCCVTAAATGGGGTCrCTACGCGCCCT 300 
PEL5KKWGLYAPYPSL 57 



301 CACGATGAArw-rCCG'lTr C C IClX iG ACGTCCCGGATGACTGCCACATCAC 350 
QDESPFPLDVPDDCHIT 74 



3 S 1 CrrrCTGCACGTCCTGGCCCGACATGGAGCGaKTI^^ 400 
PVQVLARHGARSPTDSEC 91 



401 ACACAAAGGCXTATCCCGCGACTATTGCACCCATCCAGAACAAT^ 450 
TKAYAATIAAtOKNAT 107 



451 G CG tT C CCCGGCAAATACGC Cl iLClU AAGTCCTACAATTACrCCATGGG S 0 0 
ALPCKYAFLKSYNYSMG 124 



501 CTCCG AGAACCTGAACCCCTTCGGGCGCAACCAACTGCAAGAT^ 550 
SEMLMPFGRMQLQDLGA 141 



551 CCCACTTCTACCGtCGCTACCACACCCTCACCCGGC^CATCAACCC- L i IL 600 
QFYRRYDTLTRHtNPF 157 



601 GTCCGGGCCCCGGATTCCTGCCGCGTCCACGAATCAGCCGAa^AG^ 630 
VRAADSSRVKESAEKFV 174 
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• . , . . 

6 S L CGAGGCXrrrCCAAAACCCCCGCCJUVGGTOATCCTCACCCC^ 700 

EGFQMARQGD PKAMP HQ 191 

" • • • • 

701 AGCCGTCGCaXrGCGTGGATGTACTCATCCCCGAAGGC^ 7 S 0 

PSPRVDVVrpEGTAYN 207 



• • • • • 

751 AACACGCTCGAGCACiVGCATCrGC^CCGCCrTCGA^ 800 
NTLEKSICTAFEASTVG 224 



* • • • ■ 

801 CGACGCCGCGGCACACAACrrCACTGCC G ' l - G iT CGCCCCGGCGATCGCCA 850 

DAAADNFTAVFAPAIAK 241 

* • • • • 

' 851 AGCGTCTGGAGGCCGATCTGCCCXGCGTCCAGCTGTCCGCCGAC^^ 900 

RLEA OLPGVQLSADOV 257 

* • • • ♦ 

901 GTCAATCTGATOXrCATCTGTCCCTT^^ 9S0 

VelLMAMCPFETVSLTDO 274 



• • » • - 

951 CGCGCACACCCTGTCGgtLt. L' LtlCGACr rCTTTCACCGCCGCCGAGTGGA 1000 
AHTLSPFCDLFTAAEWT 291 



• • • • > 

1001 C(^i^ACAACTXC(rrG(rTCrCC^^^^ 1050 
QYftrVLLSLDKYYGYGG 307 



• • • • • 

10 SI GGCAATCCGCTGGGCCCCGTGCAGGGCXrKWGCTGGCCC^^ 1100 
C^fPI.GPV*QCVGWA^fEL X 324 



• * • • • 

1101 CCCGCGCCrGACCCGCTCCCCCCTCCACGACCACACCn^^ 1150 

ARtiT RS PVKO KT CVN NT 341 

1151 CCCTCGACGCCAACCCGGCCACCTTCCCGCrGAAC 1200 

L DANPATFPLNATLYA '357 



1201 GA <- L L L gCG CACG ACAGTAACCrGCTCTCGAT^ fU, L»JGGCGCTGGCTCT 1250 
OFSHOSMLVSIrWACGL 374 
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Figure 7/H 



I2SL rrrACAACGGCACC^AGCCCCTCTCGCACArt^^ 1300 
YNGTK PLS QTTV EOrXR 391 



L30L yACGGACGgCTACSCC<:<:CGCCT^ 13S0 
T D G YAAAWTV P FA.ARA 407 



U5L ^XC^TTGXGXTGXTGCAGrcrTCG^ L400 
Y r EMMQCaAElCQPLVRV 424 



1401 GCTGGTCAACGACCOTCTCATKCGCT^^ 1450 
LVMORVMPLKGCAVDNL 441 
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34.406 


38.129 


1.00 


27.79 


1DXK1869 


ATOM 


1778 


CA 


T5fR 


236 


-21.339 


33.577 


37.188 


1.00 


27.49 


1DXK1870 


ATOM 


1779 


C 


TYR 


236 


-22.024 


34.490 


. 36.171 


1.00 


28.77 


10XX1871 


ATOM 


1780 


0 


TYR 


236 ■ 


-22.106 


34.165 


34.980 


1.00 


30.28 


10XX1872 


ATOM 


1781 


ca 


TYR 


236 


-22.405 


32.741 


37.908 


1.00 


28.51 


1DXK1873 


ATOM 


1782 


CG 


TYR 


236 


-21.873 


31.589 


38.735 


1.00 


31.65 


1DIX1874 


ATOM 


1783 


CDl 


TYR 


236 


-20.517 


31.243 


38.711 


1.00 


32.72 


101X1875 


ATOM 


1784 


CD2 


TYR 


236 


-22.733 


30.838 


39.546 


1.00 


33.46 


101X1876 


ATOM 


1785 


CEl 


TYR 


236 


-20.030 


30.181 


39.471 


1.00 


33.24 


1DIX1877 


ATOM 


1786 


CE2 


TYR 


236 


-22.258 


29.768 


40.315 


1.00 


35.17 


101X1878 


ATOM 


1787 


cz 


TYR 


236 


-20.904 


29.446 


40.271 


1.00 


39.13 


101X1879 


ATOM 


1788 


OH 


TYR 


236 


-20.422 


28.393 


41.025 


1.00 


43.32 


101X1880 


ATOM 


1789 


N 


LBU 


237 


-22.513 


35.632 


36.656 


1.00 


29.13 


101X1881 


ATOM 


1790 


CA 


LEU 


237 


-23.179 


36.634 


35.818 


1.00 


27.63 


101X1882 


ATOM 


1791 


C 


LEU 


237 


-22.173 


37.237 


34.842 


1.00 


27.17 


1DIK1883 


ATOM 


1792 


0 


LEU 


237 


-22.506 


37.527 


33.699 


1.00 


30.67 


101X1884 



71 



EP 0 897 010 A2 



FierurB »/2a 



ATCM 


1793 


CB 


LEU 


237 


-23.813 


37,734 


36.682 


1.00 


25.77 


1DIK1885 


ATOM 


1794 


CC 


LEO 


237 


-25.081 


37.327 


37.447 


1.00 


22.88 


10IKiaa6 


ATOM 


1795 


CDl 


LEU 


237 


-25.528 


38.415 


38 .421 


1.00 


24.36 


101X1887 


ATOM 


1796 


CD2 


LEU 


237 


-26.165 


37,036 


36.455 


1.00 


17.71 


1DIX188B 


ATOM 


1797 


M 


MET 


238 


-20.940 


37.423 


35.286 


1.00 


24.39 


101X1889 


ATOM 


1799 


CA 


MET 


238 


-19.918 


37.948 


34 .403 


1.00 


25.37 


1DIK1890 


ATOM 


1799 


C 


MET 


238 


-19.575 


36.882 


33.359 


1.00 


28.12 


101X1891 


ATOM 


1800 


0 


MET 


238 


-19.335 


37.210 


32.195 


1.00 


31,98 


101X1892 


ATOM 


1801 


CB 


MET 


238 


-18.684 


38.358 


35.203 


1.00 


21.01 


101X1893 


ATOM 


1B02 


CG 


MET 


238 


-18.967 


39.504 


36.148 


1.00 


18.56 


101X1894 


.ATOM 


1803 


50 


MET 


238 


-17.500 


40.183 


36,915 


1.00 


26.85 


1DIK1895 


ATOM 


1804 


CE 


MET 


23B 


-16.964 


41.396 


35.681 


1.00 


20.81 


1DIK1896 


ATOM 


1805 


N 


ASP 


239 


-19.562 


35.608 


33.764 


1.00 


29.27 


101X1897 


ATOM 


1806 


CA 


ASP 


239 


-19.268 


34.497 


32.838 


1.00 


28.12 


101X1898 


ATOM 


1807 


C 


ASP 


239 


-20.314 


34.486 


31.727 


1.00 


26.51 


101X1899 


ATOM 


1808 


0 


ASP 


239 


-20.003 


34.210 


30.577 


1.00 


27 .19 


1DIK1900 


ATOM 


1809 


C3 


ASP 


239 


-19.340 


33.129 


33.542 


1.00 


27.22 


10IK1901 


ATOM 


1810 


CG 


ASP 


239 


-18.144 


32.836 


34.448 


1.00 


24.04 


101X1902 


ATOM 


1811 


001 


ASP 


239 


-17.162 


33.602 


34 .461 


1.00 


22.09 


1DZK1903 


ATOM 


1812 


O02 


ASP 


239 


-18.194 


31.807 


35.162 


1.00 


27.85 


1DZX1904 


ATOM 


1813 


N 


MET 


240 


-21.557 


34.784 


32.089 


1.00 


24 .64 


1DZX1905 


ATOM 


1814 


CA 


MET 


240 


-22.664 


34 .800 


31.141 


1.00 


26.17 


1DXK1906 


ATOM 


1815 


C 


MET 


240 


-22.509 


35.815 


30.018 


1.00 


26.18 


101X1907 


ATOM 


1816 


0 


MET 


240 


-23.034 


35.621 


28.924 


1.00 


25 .09 


101X1908 


ATOM 


1817 


CB 


MET 


240 


-23.984 


35.039 


31.877 


1.00 


29.12 


101X1909 


ATOM 


1818 


CC 


MET 


240 


-24.519 


33.809 


32.591 


1.00 


28.59 


1DIK1910 


ATOM 


1819 


so 


MET 


240 


-24.873 


32.469 


31.404 


1.00 


34.30 


1DIX1911 


ATOM 


1820 


CE 


MET 


240 


-26.354 


33.141 


30.505 


1.00 


25.6 9 


101X1912 


ATOM 


1821 


N 


CYS 


241 


-21.793 


36.899 


30 .280 


1.00 


25.30 


1DIK1913 


ATOM 


1822 


CA 


crs 


241 


-21.581 


37 .910 


29.254 


1.00 


27.89 


101X1914 


ATOM 


1823 


C 


CYS 


241 


-20.931 


37.244 


28 .032 


1.00 


25 .49 


1DXK1915 


ATOM 


1824 


0 


CYS 


241 


-21.348 


37.459 


26.692 


1.00 


26 .76 


101X1916 


ATOM 


1825 


CB 


CYS 


241 


-20.710 


39.049 


29.814 


1.00 


24.99 


1DIK1917 


ATOM 


1826 


SG 


CYS 


241 


-19.856 


40.115 


28 .598 


1.00 


27.44 


10ZK191B 


ATOM 


1827 


N 


SEH 


242 


-19.924 


36.420 


28 .291 


1.00 


25.28 


101X1919 


ATOM 


1828 


CA 


SER 


242 


-19.197 


35.698 


27.256 


1.00 


25.73 


101X1920 


ATOM 


1829 


C 


SER 


242 


-20.072 


34.679 


26.507 


1.00 


26,63 


1DIK1921 


ATOM 


1830 


o 


SER 


242 


-20.275 


34 .786 


25.289 


1.00 


25.57 


101X1922 


ATOM 


1831 


CB 


SER 


242 


-18.003 


34.989 


27.897 


1.00 


27,44 


101X1923 


ATOM 


1832 


OG 


SER 


242 


-17.294 


34.219 


26.949 


1.00 


39.33 


101X1924 


ATOM 


1833 


N 


PHE 


243 


-20.594 


33.697 


27.238 


1.00 


26.85 


10IK1925 


ATOM 


1834 


CA 


PHE 


243 


-21.419 


32.642 


26.648 


1.00 


26.10 


1DIX1926 


ATOM 


1835 


C 


PHE 


243 


-22.663 


33.140 


25.941 


1.00 


27.04 


101X1927 


ATOM 


1836 


o 


PRE 


243 


-23.021 


32.637 


24.872 


1.00 


26.53 


1DIK1928 


ATOM 


1837 


CB 


PHE 


243 


-21.818 


31.617 


27.714 


1.00 


25.98 


1DIK1929 


ATOM 


1838 


CG 


PHB 


243 


-20.655 


30.845 


28.277 


1.00 


26.19 


101X1930 


ATOM 


1839 


COl 


PHE 


243 


-20,075 


29.805 


27.549 


1.00 


23.12 


101X1931 


ATOM 


1840 


CD2 


PHE 


243 


-20.129 


31.168 


29.530 


1.00 


21.56 


1DIX1932 


ATOM 


1841 


CEl 


PHE 


243 


-18.978 


29.093 


28.066 


1.00 


25.40 


101X1933 


ATOM 


1842 


CE2 


PHB 


243 


-19.042 


30.469 


30.052 


1.00 


22.05 


1DIK1934 


ATOM 


1843 


CZ 


PHB 


243 


-18.461 


29.427 


29.319 


1.00 


21,32 


10ZK1935 


ATOM 


1844 


N 


ASP 


244 


-23.330 


34.126 


26.529 


1.00 


27.71 


101X1936 


ATOM 


184S 


CA 


ASP 


244 


-24.537 


34.643 


25.907 


1.00 


30.62 


1DIK1937 


ATOM 


1846 


C 


ASF 


244 


-24.235 


35.494 


24.666 


l.OQ 


32.18 


1DXK1938 


ATOM 


1847 


0 


ASP 


244 


-25.120 


35.741 


23.854 


1.00 


34 .05 


101X1939 


ATOM 


1848 


C3 


ASP 


244 


-25.379 


35.428 


26.917 


1.00 


27.50 


1DZK1940 


ATOM 


1849 


CG 


ASP 


244 


-26.785 


35.723 


26.404 


l.OQ 


28.82 


1DIX1941 


ATOM 


IBSO 


001 


ASP 


244 


-27.531 


34.777 


26.077 


1.00 


24.13 


10ZK1942 


ATOM 


1851 


0D2 


ASF 


244 


-27.149 


36.910 


26.323 


1.00 


30.15 


1DIK1943 


ATOM 


1852 


N 


THR 


245 


-22.996 


35.944 


24.510 


1.00 


32.39 


1DIK1944 


ATOM 


1853 


CA 


THR 


245 


-22.658 


36.740 


23.343 


1.00 


33.50 


10ZX1945 


ATOM 


1854 


C 


THR 


245 


-22.282 


35.850 


22.153 


1.00 


38.87 


101X1946 


ATOM 


1855 


0 


THR 


245 


-22.911 


35.918 


21.091 


1.00 


37.64 


1DZX1947 


ATOM 


185 6 


CB 


THR 


245 


-21.511 


37 .706 


23 .647 


1.00 


32 .72 


1DIX194B 


ATOM 


1857 


OGl 


THR 


245 


-21.940 


33.652 


24.631 


1.00 


37.51 


101X1949 


ATOM 


1858 


CG2 


THR 


245 


-21,084 


38.446 


22.394 


1.00 


28.33 


10IK1950 


ATOM 


1859 


N 


ZLE 


246 


-21.268 


35.008 


22.335 


1.00 


40 .95 


10IR1951 


ATOM 


1860 


CA 


XLE 


246 


-20.803 


34.140 


21.262 


1.00 


43.89 


1DZK19S2 


ATOM 


1861 


C 


XLE 


246 


-21.590 


32.847 


21.034 


1.00 


48.67 


101X1933 


ATOM 


1862 


0 


ILE 


246 


-21.151 


31.992 


20.262 


1.00 


48.67 


1DXK19S4 


ATOM 


1863 


C3 


XLE 


246 


-19.279 


33.826 


21.419 


1.00 


41.80 


101X1955 


ATOM 


1864 


CGI 


ILE 


246 


-18,988 


33.144 


22.757 


1.00 


37.29 


1DIK19S6 


ATOM 


18 65 


CG2 


ILE 


246 


-18.464 


35.125 


21.329 


1.00 


43.70 


1DXK1957 


ATOM 


1866 


CDl 


ILE 


246 


-17.508 


33.098 


23.087 


1.00 


29,59 


1DIX19S8 


ATOM 


1867 


N 


SER 


247 


-22.747 


32.710 


21.688 


l.OQ 


56.97 


1DIX19S9 


ATOM 


1868 


CA 


SER 


247 


-23.601 


31.523 


21,534 


1.00 


63.90 


1DIK1960 


ATOM 


1869 


C 


SER 


247 


-24.145 


31.424 


20.107 


1.00 


70.33 


1DIK1961 


ATOM 


1870 


0 


SER 


247 


-24.503 


30.328 


19.662 


1.00 


73 .70 


1DZX1962 


ATOM 


1871 


ca 


SER 


247 


-24.787 


31.567 


22.499 


1.00 


67,11 


1DZK1963 


ATOM 


1872 


OG 


SER 


247 


-25.783 


32.483 


22.058 


1.00 


66.63 


IDIX1964 


ATOM 


1873 


N 


THR 


248 


-24.213 


32.569 


19.411 


1.00 


74 .67 


101X1965 


ATOM 


1874 


CA 


THR 


246 


-24 .683 


32.671 


18.015 


1.00 


74.94 


101X1966 


ATOM 


1875 


c 


THR 


248 


-23.546 


33.054 


17.049 


1.00 


76.44 


101X1967 


ATOM 


1876 


0 


THR 


248 


-23.272 


34 .241 


16.808 


1.00 


77.32 


1DIX1968 


ATOM 


1877 


ca 


THR 


248 


-25.910 


33.719 


17.893 


1. 00 


74.59 


1DZX1969 
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ATOM 


1878 


OGl 


THR 


248 


-26.917 


33.299 


18.699 


1.00 


74.12 


1DIK1970 


ATOM 


1879 


CG2 


THR 


248 


-26.262 


33.688 


16.427 


1.00 


72.56 


1DIK1971 


ATOM 


L880 


N 


THR 


253 


-23.919 


38.646 


16.591 


1.00 


49.08 


10IK1972 


ATOM 


1861 


CA 


THR 


253 


-24 .110 


39.972 


15.999 


1.00 


51.56 


1DIK1973 


ATOM 


18B2 


C 


THR 


253 


-24 .258 


41.028 


17.100 


1.00 


49.54 


101X1974 


ATOM 


1883 


0 


THR 


253 


-23.757 


42.146 


16.965 


l.OO 


49.39 


1DIK197S 


ATOM 


1884 


CB 


THR 


253 


-25.394 


40.051 


15.103 


1.00 


52.26 


1DIK1976 


ATOM 


1885 


OGl 


THR 


253 


-25.489 


38.881 


14 .282 


1.00 


61.78 


1DIK1977 


ATOM 


1886 


CG2 


THR 


253 


-25.344 


41.284 


14.169 


1.00 


51.28 


1DIK1978 


ATOM 


1887 


N 


LYS 


254 


-24.941 


40.670 


18.185 


1.00 


46.23 


1DIK1979 


ATOM 


188B 


CA 


LYS 


254 


-25.170 


41.610 


19.275 


1.00 


45.73 


1DIK1980 


ATOM 


1S89 


C 


LYS 


254 


-24.562 


41.222 


20.618 


1.00 


42.99 


1DZK19B1 


ATOM 


1890 


0 


LYS 


254 


-24,470 


40.044 


20. 939 


1.00 


45 .43 


1DIK19B2 


ATOM 


1691 


CB 


LYS 


254 


-26.664 


41.864 


19.410 


1.00 


46.12 


1DIK1963 


ATOM 


1692 


CG 


LYS 


254 


-27.214 


42.520 


18 .157 • 


1.00 


51.91 


1DIK1984 


ATOM 


1893 


CD 


LYS 


254 


-28.671 


42.840 


18 .273 


1.00 


56.51 


1DIK1985 


ATOM 


1894 


CE 


LYS 


254 


-29.168 


43.456 


16.987 


1.00 


60.4 4 


1DIK1986 


ATOM 


1695 


NZ 


LYS 


254 


-30.576 


43 .918 


17.160 


1.00 


67.71 


1DIK19B7 


ATOM 


1696 


N 


LEO 


255 


-2 4 .141 


42 .226 


21.387 


1.00 


38 .90 


10IK19B8 


ATOM 


1897 


CA 


IXU 


255 


-23.533 


42.022 


22.705 


1.00 


33.29 


1DIK1989 


ATOM 


1893 


c 


LEU 


255 


-24.584 


41.661 


23.751 


1.00 


32.90 


1DIK1990 


ATOM 


1899 


o 


LEU 


255 


-25,637 


42.268 


23.825 


1.00 


33.07 


-10IK1991 


ATOM 


1900 


CB 


LEU 


255 


-22.797 


43 . 286 


23 .141 


1.00 


25.06 


1DIK1992 


ATOM 


1901 


CG 


LEU 


255 


-21.856 


43.213 


24 .344 


1.00 


26.05 


1DIK1993 


ATOM 


1902 


CDl 


LEU 


255 


-20.707 


42.242 


24 .095 


1.00 


21.04 


1DIK1994 


ATOM 


1903 


C02 


LEU 


255 


-21.326 


44.595 


24 .606 


1.00 


22.14 


10IK199S 


ATOM 


1904 


N 


SER 


256 


-24 ,293 


40 .647 


24 .558 


1.00 


32 .93 


10IK1996 


ATOM 


1905 


CA 


SER 


256 


-25,212 


40.207 


25.596 


1,00 


31.52 


10IK1997 


ATOM 


1906 


c 


SER 


256 


-25 .469 


41.293 


26.643 


1.00 


32 .01 


1DIK1998 


ATOM 


1907 


O 


SER 


256 


-24 .574 


42.013 


27.082 


1.00 


30 .64 


10IK1999 


ATOM 


1908 


CB 


SER 


256 


-24.650 


38.965 


26.306 


1.00 


30.76 


1OIK2000 


ATOM 


1909 


OG 


SER 


256 


-25.446 


38.602 


27.430 


1.00 


27.78 


1DIK2001 


ATOM 


1910 


N 


PRO 


257 


-26,766 


41,432 


27.051 


1.00 


32.46 


1OIK2002 


ATOM 


1911 


CA 


PRO 


257 


-27.131 


42.434 


28.060 


1.00 


32-73 


1DIK20Q3 


ATOM 


1912 


c 


PRO 


257 


-26.3 72 


42. 191 


29.379 


1.00 


31 .79 


1DIK2004 


ATOM 


1913 


O 


PRO 


257 


-26,136 


43,130 


30.138 


1.00 


35 .33 


1DIK200S 


ATOM 


1914 


CB 


PRO 


257 


-28 .644 


42.239 


28.209 


1.00 


32.20 


1OIK2006 


ATOM 


191S 


CG 


PRO 


257 


-29.053 


41. 696 


26.861 


1.00 


31.42 


1DIK2Q07 


ATOM 


1916 


CO 


PRO 


257 


-27.960 


40.701 


26.587 


1.00 


30.36 


1DIX2008 


ATOM 


1917 


N 


PHE 


258 


-25.984 


40.941 


29.646 


1.00 


27.64 


1DIK2009 


ATOM 


1918 


CA 


PHE 


258 


-25.221 


40.617 


30.859 


1.00 


25.04 


1OIK2010 


ATOM 


1919 


C 


PHE 


258 


-23.910 


41.407 


30.928 


1.00 


24.80 


1DZK2011 


ATOM 


1920 


o 


PHE 


25 8 


-23.389 


41.661 


32.011 


1.00 


22.95 


1DIK2012 


ATOM 


1921 


CB 


PHE 


258 


-24 .877 


39.122 


30.911 


1.00 


25.72 


101X2013 


ATOM 


1922 


CG 


PHE 


258 


-26.033 


38.239 


31.253 


1.00 


24.20 


101X2014 


ATOM 


1923 


COl 


PHE 


258 


-26.455 


38.105 


32.569 


1,00 


23.98 


1DIX201S 


ATOM 


1924 


CD 2 


PHE 


258 


-26.712 


37.544 


30.257 


1.00 


26,82 


ID 1X20 16 


ATOM 


1923 


CEl 


PHE 


258 


-27.543 


37.288 


32.890 


1,00 


23.72 


1DIX2017 


ATOM 


1926 


ce2 


PHE 


258 


-27.806 


36.721 


30.565 


1.00 


24.56 


101X2018 


ATOM 


1927 


cz 


PHE 


258 


-28.220 


36.594 


31.679 


1.00 


24.85 


1DIX2019" 


ATOM 


1928 


N 


CYS 


259 


-23.375 


41.790 


29.772 


1.00 


24.43 


1OIK2020 


ATOM 


1929 


CA 


CYS 


259 


-22.119 


42.536 


29.731 


1.00 


27.30 


1DIK2021 


ATOM 


1930 


c 


CYS 


259 


-22.323 


43.952 


30.207 


1.00 


29.38 


1DIK2022 


ATOM 


1931 


o 


CYS 


259 


-21.420 


44 .565 


30.771 


1.00 


29.68 


101X2023 


ATOM 


1932 


CB 


CYS 


259 


-21.564 


42.591 


28.307 


1.00 


26.45 


1DIK2024 


ATOM 


1933 


SG 


CYS 


259 
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Figure 14a/l 

Primer set A 

primer Q27L s 5' 
primer Q27L as 3' 

Primer set B 

primer Q274L s 5' 
primer Q274L as 3' 

Primer set C 

primer G277D s 5' 
primer G277D as 3' 

Primer set D 

primer G277D* s 5' 
primer G277D* as 3 ' 

Primer set E 

primer N340S s 5' 
primer N340S as 3' 
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Primer sot F 

primer G277K s 5' C CTT CAG TCC TTG A^S AAG TAG TAC GGC TAG 3 
primer G277K as 3' G GAA GTC AGG AAC TTC ATG ATG CCG ATG 5 
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Figure 14a/3 

Primer set I« 

primer S130N s 5' CTG GCG CGC A^T GTG GTG CCG TTT ATT C 3' 
primer S130N as 3' GAC CGC GCG TXA CAC CAC GGC AAA TAA G 5' 

L A R Ni3o V V P F I 

Primer set H 

primer Ra29L/Sl30N s 5' GCT CTG GCG CXC A^T GTG GTG CCG TTT ATT C 3 
primer R129L/S130N as 3' CGA GAC CGC G^G ITA CAC CAC GGC AAA TAA G 5 

ALA Li29 Ni30 V V P F I 

Primer set N 

primer K167G/R168Q s 5' GAC CAT GGC TCC QQA CBA GCT ACG CCA G 3 
primer K167G/R168Q as 3' CTG GTA CCG AGG CCT GXE CGA TGC GGT C 5 

D H G S Oi67 Ql68 ATP 



100 



EP0 897 010 A2 





in 


in 


in 




in 




in 




in 
■ 




CT - 


t 






CT - 


m 


CT - 


m 


CT - 


1 

O 


GAG 


' s 




1 

u 


GAG 


t 

U 


GAG 


1 

O 


GAG 


O 


AGC 






TCG 


AGC 


TCG 


AGC 


TCG 


AGC 



u 
u 

g 



5 



CJ 



1 

Eh 

s 

8 

CJ 



I 

o 



u 

CJ 

o 



6 



a 



b 

I 



i: 



5 

U 

u 
u 

8 



O 

o 



(J 



< 

8 



s 



•4; 

8 

o 

O 

o 



o 

8 

o 



o 



8 



CD CJ 
Q CJ 



^ 1 



O 

8 



8 



1 



Eh 



Eh 

8 



CJ 



o 

8 



u 
u 



CJ 



CJ 



4i 
9 



•a 1 









CQ 




m 


m 




4J 






4i 


C 








Q 




01 


a 




CO 


(0 




»4 


» 


1 

r- 


u 




(N 


M 


1 


fl 




•H 














u 








At 



s 

U 



01 

(0 IQ 
I I 



101 



EP0 897 010 A2 



Figure 15: 
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Figure 17a: 
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Figure 17b: 
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Figure 26: 
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